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2  Introduction 


Three  years  ago,  we  initiated  an  ambitious,  long-term  effort  to  develop  a  generic 
architecture  for  large-scale  distributed  real-time  situation  assessment  systems  (DSA). 
Significant  progress  has  been  made  toward  this  goal  through  the  development  of  the  major 
components  that  are  necessary  for  such  an  architecture  and  through  the  investigation  of 
approaches  for  formally  analyzing  DSA  systems  and  applications. 

Research  into  a  generic  agent  architecture  for  real-time  DSA  holds  much  promise  because  it 
would  greatly  facilitate  the  development  of  a  variety  of  military  and  commercial  applications 
of  DSA  and  related  systems.  Examples  of  such  applications  include:  distributed  sensor 
networks,  distributed  network  diagnosis,  distributed  information  retrieval  and  digital 
libraries,  distributed  perceptual  processing  for  cooperating  robots/autonomous  vehicles, 
and  computer-supported  cooperative  work  on  situation  analysis  tasks.*  DSA  will  also  be  an 
integral  component  of  many  distributed  planning  and  scheduling  systems. 

We  see  these  kinds  of  applications  emerging  much  sooner  than  we  had  anticipated  three 
years  ago  as  a  result  of  the  quickening  development  of  the  NIL  Recently,  there  has  been 
significant  effort  directed  toward  building  the  hardware/software  infrastructure  that  makes 
these  kinds  of  applications  possible  because  it  allows  large  numbers  of  intelligent  agents  to 
interact  over  both  local  and  long-distance  networks  seamlessly.  An  example  of  such 
emerging  software  infrastructure  is  the  Telescript  architecture  from  General  Magic  and 
high-level  protocols  such  as  KQML. 

Despite  these  infrastructure  advances,  there  remain  many  software-related  questions  that 
must  be  answered  in  order  to  develop  large-scale  distributed  situation  assessment  systems. 
We  see  the  key  issues  as  how  to  organize  both  local  agent  and  network-wide  problem 
solving  so  that  the  agents  can  cooperate  effectively  to  produce  answers  of  appropriate 
quality  within  fixed  deadlines,  using  limited  communication  bandwidth,  and  have  their 
performance  degrade  gracefully  as  sensors,  communication  links,  and  processors  fail. 

To  address  these  issues  we  have  been  investigating  the  use  of  the  DRESUN  system  as  the 
basis  for  a  generic  architecture  for  distributed  real-time  situation  assessment.  This 
architecture  was  chosen  because  our  earlier  research  suggested  that  sophisticated,  self- 
aware  agents  were  required  to  support  the  flexible  and*  dynamic  problem-solving  strategies 
that  are  necessary  in  complex  DSA  tasks.  The  major  areas  of  effort  in  this  research  to  date 
have  been  in: 

•  creating  a  highly  flexible  testbed,  with  agents  that  can  support  very  wide  range  of 
interaction  protocols; 

•  extending  the  capabilities  of  the  agents  to  model  the  beliefs  of  other  to  facilitate  the 
communication  and  use  of  incomplete  information  information  at  multiple  levels  of 
abstraction; 

•  developing  appropriate  coordination  algorithms/protocols; 

•  integrating  real-time  deadlines  into  the  control  framework. 

We  have  also  been  pursuing  several  different  techniques  for  formally  characterizing/ 
analyzing  important  aspects  of  DSA  systems  and  applications.  We  have  made  significant 
progress  on  these  areas. 


'  It  is  this  last  application  for  which  we  see  a  near-term  use  for  our  previous  research  results,  and  for  which 
we  are  proposing  an  optional  technology  demonstration  project. 
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Figure  1;  Our  model  of  agents  for  distributed  real-time  situation  assessment. 


1  igure  2.  Embedding  a  DSA  system  in  a  complex  distributed  problem-solving  system. 
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Our  recent  work  has  emphasized  the  development  of  an  agent  model  that  can  support 
sophisticated  coordination  strategies  in  the  context  of  real-time  constraints.  This  model  is 
shown  in  Figure  1.  The  task  database  maintains  a  representation  of  the  local  agent's  tasks 
and  the  tasks  of  other  agents,  with  information  about  the  relations  among  the  tasks.  Local 
task  information  is  provided  by  the  interpretation  system  module,  which  is  also  reponsible 
for  low-level  control  decisions,  or  when  there  are  no  relevant  task  relationships 
(subproblem  interactions)  or  real-time  constraints.  When  there  are  relevant  task 
relationships  and  real-time  constraints,  the  real-time  scheduler  uses  the  task  information  and 
interacts  with  the  interpretation  module  to  select  appropriate  tasks/actions.  We  are  currently 
working  on  the  integration  of  the  interpretation  module  with  the  other  modules  and  expect 
the  further  development  of  this  model  will  be  an  important  focus  of  continued  research. 

In  addition  to  the  progress  that  we  have  made  developing  the  key  elements  of  a  generic 
DSA  architecture,  many  of  the  issues  that  we  are  exploring  are  of  general  interest  for 
research  on  cooperative  distributed  problem-solving  systems.  For  example,  a  key  issue  we 
have  had  to  address  is  the  bounded  rationality  of  the  agents,  which  results  from  limited 
computational,  communications,  and  sensing  resources.  This  is  important  because  it  affects 
both  the  satisficing  control  strategies  and  the  agent  coordination  strategies  by  introducing 
interpretation  and  control  uncertainty  since  agents  are  inherently  limited  to  incomplete  and 
possibly  inconsistent  views  of  the  situation  and  the  problem-solving  activities  of  other 
agents.  Another  key  issue  our  research  is  addressing  is  how  to  deal  with  interacting 
subproblems  in  sophisticated  knowledge-based  systems.  The  formal  analysis  work  that  we 
have  done  is  beginning  to  give  us  the  tools  to  understand  how  complex  problem-solving 
architectures  interact  with  the  characteristics  of  particular  domains. 

The  next  section  gives  an  overview  of  the  key  research  issues  that  must  be  addressed  in 
complex  DSA  systems  and  outlines  our  basic  research  approach.  This  is  followed  by  a 
section  that  examines  the  key  areas  in  this  research  that  are  of  general  interest — ^i.e.,  that  are 
applicable  beyond  DSA.  A  summary  of  the  progress  we  have  made  so  far  is  given  in 
Section  5. 

3  Overview  of  the  Key  Issues  in  this  Research 

The  key  issue  in  building  complex  DSA  systems  is  hQw  to  organize  local  problem  solving 
in  each  agent  (node)  so  that  the  agents  can  cooperate  to  achieve  globally  coherent  behavior 
—  i.e.,  so  that  they  can  work  together  effectively  to  get  a  consistent  and  accurate 
assessment  of  the  entire  situation.  Coherent  distributed  problem  solving  means,  in  part,  that 
the  system  must  have  the  ability  to  produce  answers  of  appropriate  quality,  in  a  timely 
manner,  using  limited  communication  bandwidth.  It  is  also  important  that  performance 
degrades  gracefully  as  processors,  sensors,  and  communication  links  fail.  Coherence  is  a 
difficult  issue  because  each  agent  has  a  limited  viewpoint  on  the  developing  solution,  the 
available  data,  and  the  problem-solving  activities  of  the  other  agents.  This  can  result  in 
agents  doing  redundant  work,  failing  to  produce  results  that  are  necessary  for  other  agents 
to  proceed,  communicating  unnecessarily,  etc. 

The  main  source  of  these  difficulties  is  the  issue  of  subproblem  interactions — i.e.,  the 
agents'  local  solutions  are  often  not  independent.  This  occurs  whenever  data  (evidence)  for 
an  interpretation  hypothesis  is  spread  among  multiple  agents  or  when  agent  interest  areas 
overlap  as  a  result  of  overlapping  sensor  coverage.  When  agent  subproWems  interact,  the 
construction  of  a  global  solution  may  not  be  straightforward — e.g.,  local  agent 
interpretations  may  be  inconsistent  because  they  are  based  on  different  incomplete  subsets 
of  the  data.  In  such  situations,  significant  agent  interactions  may  be  required  if  the  globally 
most  likely  interpretations  are  to  be  determined.  However,  both  local  problem  solving  and 
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inter-agent  communications  must  be  carefully  controlled  to  avoid  excessive  communication 
costs,  delays,  and  redundant  work. 


Figure  3:  An  example  of  inconsistent  local  interpretations. 

The  application  is  vehicle  monitoring.  Agent  A  and  Agent  B  receive  data  only  from  their  own  individual 
sensors,  whose  coverage  regions  overlap.  Agent  A’s  data  is  represented  by  squares  and  agent  B’s  by  circles, 
with  positions  as  indicated  at  the  times  denoted  by  the  associated  numbers.  The  grey  density  of  the  data 
points  corresponds  to  the  relative  “quality”  of  the  data — i.e.,  the  a  priori  likelihood  that  the  data  would  have 
resulted  from  a  real  vehicle.  “Empty”  points  denote  data  whose  existence  has  been  assumed  by  the  agents. 
Based  on  its  own  data,  each  agent  would  form  the  local  interpretations  shown:  agent  A  would  hypothesize 
vehicle  track  Ta  and  agent  would  hypothesize  vehicle  track  Tj.  To  covers  agent  A’s  data  from  times  2  through 
11,  and  T(,  covers  agent  B’s  data  from  times  1  through  10.  These  tracks  are  inconsistent  since  they  imply 
that  either  a  single  vehicle  is  in  different  places  at  the  same  time  or  else  two  vehicles  are  in  the  same  place  at 
the  same  time.  This  inconsistency  cannot  be  immediately  resolved  because  neither  To  nor  Tj,  is  significantly 
more  likely  than  the  other  (each  includes  some  good  quality  data  and  some  poor  quality  data).  The  preferred 
global  interpretation — given  a  complete  view  of  the  data  from  both  agent  A  and  agent  B — is  Ta+b  because 
it  covers  more  high  quality  data  than  either  of  the  local  tracks  (the  remaining  uninterpreted  data  is  due  to 
ghosting  phenomena  and  may  or  may  not  be  explicitly  interpreted  depending  on  the  termination  criteria  of 
the  system).  Ta+b  covers  agent  B’s  data  from  times  1  through  6  and  agent  A’s  data  from  times  5  through 
11  (it  covers  both  agents’  consistent  data  at  times  5  and  6). 

The  scenario  in  Figure  3  is  an  example  of  a  situation  in  which  local  solutions  are 
inconsistent,  and  extended  agent  interactions  are  necessary  to  resolve  the  inconsistency.  In 
the  example,  their  own  local  data  will  cause  agent  A  and  agent  B  to  form  track  hypotheses 
(Ta  and  Tb,  respectively)  that  are  inconsistent  with  each  other.  Because  the  tracks  extend 
through  an  area  of  overlapping  interest,  the  agents  recognize  that  they  can  communciate  to 
try  to  verify  the  global  consistency  of  their  local  interpretations.  Simply  exchanging  the 
partial  solutions — i.e.,  the  track  hypotheses  without  their  supporting  evidential  structures — 
is  sufficient  to  allow  the  inconsistency  to  be  detected,  but  this  level  of  information  is  not 
sufficient  to  allow  the  inconsistency  to  be  resolved.  Resolving  the  inconsistency  in  favor  of 
the  most  likely  global  interpretation  requires  an  understanding  of  the  quality  of  the 
supporting  data  for  the  different  portions  of  each  track  in  order  to  be  able  to  identify  the 
most  likely  overall  interpretation  of  the  data. 
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Obviously,  one  way  to  insure  that  agents  have  the  information  necessary  to  resolve  global 
inconsistencies  would  be  to  always  communicate  the  complete  evidential  information 
associated  with  the  solution  hypotheses.  However,  because  interpretation  hypotheses  are 
complex  structures  that  may  be  interrelated  with  numerous  other  hypotheses, 
communication  and  processing  limitations  typically  make  it  impractical  to  fully 
communicate.  Furthermore,  complete  communication  is  usually  not  necessary — e.g.,  in 
this  example  each  agent  does  not  have  to  know  the  other's  actual  data.  What  is  needed  is  a 
system  with  the  flexibility  to  request  or  respond  with  information  at  different  levels  of 
detail — based  on  the  dynamic  problem-solving  requirements — as  part  of  an  extended 
process  of  resolving  inconsistency.  This  requires  the  ability  to  integrate  incomplete 
information,  represent  the  resulting  uncertainty,  and  use  this  uncertainty  to  drive  further 
actions. 

Our  approach  to  distributed  situation  assessment  is  based  on  the  functionally  accurate, 
cooperative  (FA/C)  paradigm  for  distributed  problem  solving.  In  the  FA/C  paradigm,  an 
agent's  local  problem  solving  is  organized  so  that  partial  and  tentative  results  can  be 
produced  despite  the  agent  having  incomplete  and  uncertain  information.  Inter-agent 
subproblem  interactions  are  viewed  as  constraints  on  the  agents'  local  solutions.  When 
these  partial  results  are  exchanged  among  agents  working  on  interdependent  subproblems, 
these  constraints  can  be  exploited  to  partially  resolve  the  inconsistencies  and  uncertainties 
that  occur  in  local  problem  solving  due  to  the  lack  of  accurate,  complete,  and  up-to-date 
information.  Resolution  can  take  the  form  of  producing  more  complete  partial  results, 
resolving  solution  uncertainty  due  to  competing,  alternative  partial  solutions,  detecting 
inconsistencies  in  previously  generated  results  (either  locally  generated  or  received  from 
other  agents),  and  speeding  up  local  problem  solving  because  the  space  of  possible 
solutions  that  needs  to  be  examined  is  constrained.  The  advantage  of  the  FA/C  approach  is 
that  it  reduces  that  amount  of  information  that  must  be  transferred  among  agents  and 
reduces  delays  that  would  result  from  the  need  for  synchronization  of  agent  problem 
solving. 

As  a  result  of  earlier  research  on  distributed  problem  solving,  we  came  to  understand  that  a 
sophisticated  agent  architecture  is  necessary  to  develop  distributed  real-time  situation 
assessment  systems.  In  this  research,  we  have  been  investigating  the  use  of  the  DRESUN 
system  as  the  basis  of  a  generic  architecture  for  distributed  real-time  situation  assessment. 
DRESUN  agents  maintain  explicit  representations  oflhe  reasons  why  their  interpretations 
are  uncertain  as  well  as  explicit  representations  of  their  goals  and  the  state  of  problem 
solving.  Because  of  this,  the  agents  are  self-aware  and  can  implement  highly  context- 
specific  strategies.  This  makes  it  possible  to  support  the  complex  agent  interaction 
protocols  that  are  necessary  for  multi-sensor  fusion  and  distributed  differential  diagnosis 
strategies.  It  also  allows  agents  to  reason  about  sensor  and  node  errors,  to  dynamically 
adjust  their  problem-solving  strategies,  and  to  base  termination  of  network  problem  solving 
on  realistic  measures  of  the  global  consistency  of  local  solutions.  Our  initial 
experimentation  has  supported  our  belief  the  DRESUN  architecture  provides  the  basis  for 
the  kind  of  flexible  and  reactive  approach  to  the  communication  and  use  of  external 
evidence  that  is  necessary  to  solve  the  sort  of  example  given  above. 

One  reason  why  a  sophisticated  agent  architecture  is  needed  is  the  inherent  complexity  of 
situation  assessment  tasks  such  as  sensor  interpretation.  In  order  to  understand  this  point,  it 
is  useful  to  understand  the  distinction  Clancey  draws  between  classification  problem 
solving  and  constructive  problem  solving,  and  to  understand  the  differences  between 
interpretation  problems  and  the  "diagnosis  problems"  that  are  typically  studied  in  research 


^  The  example  shown  here  is  simplified  to  allow  us  to  focus  on  our  main  points.  It  shows  only  a  small 
fraction  of  the  data  that  would  need  to  be  processed  by  most  real-world  DSA  systems. 
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on  abductive  inference  and  probabilistic  network  inference.  In  classification  problem 
solving,  the  solution  is  selected  from  among  a  pre-enumerated  set  of  all  the  possible 
solutions.  In  constructive  problem  solving,  the  set  of  possible  solutions  is  determined  as 
part  of  the  problem-solving  process.  Diagnosis  problems  can  be  approached  using 
classification  techniques  because  these  problems  are  propositional  (they  have  a  fixed  set  of 
causes  and  possible  findings,  with  fixed  relations  among  them).  By  contrast,  while 
interpretation  problems  have  a  fixed  set  of  top-level  cause  types  and  a  fixed  set  of  data 
types,  they  can  have  an  indeterminate  number  of  instances  of  any  of  the  types.  In 
particular,  there  can  be  an  indeterminate  number  of  instances  of  any  top-level  cause  (e.g., 
vehicles  in  a  vehicle  monitoring  system).  This  leads  to  the  problem  of  correlation  ambiguity 
(it  is  ambiguous/uncertain  which  potential  explanation  hypothesis  a  support  instance  should 
be  associated  with),  which  results  in  a  combinatorial  explosion  of  possible  explanations  for 
a  data  set.  Because  of  the  combinatorics  of  their  answer  spaces,  interpretation  problems 
require  constructive  problem  solving  techniques,  and  this  greatly  complicates  the 
complexity  of  computing  the  most  probable  explanation  (MPE). 

4  General  Intellectual  Issues 

While  our  research  is  being  carried  out  within  the  context  of  distributed  sensor 
interpretation/DSA,  many  of  the  issues  being  addressed  are  applicable  to  other  important 
applications.  For  example,  situation  assessment  is  closely  related  to  diagnosis,  so  we 
expect  that  results  from  our  research  would  apply  to  distributed  diagnosis  in  complex 
networks  such  as  LANs  and  telephone  systems.  Because  our  framework  is  not  limited  to 
classification-type  problem  solving,  it  should  be  able  to  understand  and  assess  more 
complex  situations  than  the  existing  systems  for  these  applications.  For  example,  this 
framework  can  support  the  use  of  active  sensors  that  can  be  controlled/tuned  to  best  cope 
with  the  current  situation  and  provide  data  that  can  resolve  the  most  critical  uncertainties. 
There  is  also  increasing  interest  in  the  development  and  use  of  sophisticated 
automated/autonomous  systems  such  as  robots  and  autonomous  vehicles.  Such  systems 
will  have  to  use  powerful  signal  processing  and  visual  systems,  and  their  capabilities 
should  be  able  to  be  significantly  enhanced  if  they  are  provided  with  the  ability  to  construct 
a  shared  view  of  the  situation. 

Another  example  of  an  increasingly  important  application  where  the  issues  we  are  exploring 
are  applicable  is  cooperative  information  gathering  (CIG).  Interest  in  CIG  is  a  result  of  the 
recent  proliferation  of  electronically  available  information  (e.g.,  via  the  Internet).  As  a 
result,  information  relevant  to  a  user  query  may  be  distributed  over  a  large  set  of  sites,  and 
locating  the  information  that  provides  the  best  response  may  require  a  significant  search 
process.  Clearly,  distributed  search  and  retrieval  has  potential  advantages  in  such  a 
situation.  However,  this  process  can  be  very  complex  since  users  will  have  cost  and  time 
limits  for  answering  queries,  there  will  be  numerous  trade-offs  involving  the  cost  and 
speed  of  accessing  particular  databases,  and  there  may  be  differing  costs/delays  and 
confidence  in  the  quality  of  answers  depending  on  agent/search  configurations.  In  addition 
to  the  applications  discussed  above,  the  research  we  have  performed  addresses  a  range  of 
issues  that  are  of  general  intellectual  interest  to  the  distributed  problem-solving  community: 

•  How  does  the  notion  of  the  bounded  rationality  of  agents  affect  coordination  strategies, 
satisficing  control  strategies,  and  the  quality  of  solutions; 

•  How  do  you  make  use  of  sophisticated  agents  that  have  a  very  wide  range  of  possible 
responses  to  a  situation — e.g.,  many  ways  that  they  could  be  asked  and  that  they  could 
decide  to  respond  to  a  request  for  information; 

•  What  issues  arise  in  reasoning  about  the  termination  of  problem  solving  when  using  a 


8 


satisficing  approach  to  control  under  real-time  constraints; 

•  How  does  one  coordinate  very  dynamic  systems — i.e.,  where  they  can  be  great  variance/ 
uncertainty  in  the  estimates  of  the  quality  and  duration  of  potential  tasks; 

•  What  implications  do  the  use  of  complex  and  dynamically  determined  information  sharing 
protocols  have  for  the  coordination  of  agents?  While  the  Partial  Global  Planning 
mechanism  broke  new  ground  in  coordinating  the  local  problem  solving  of  agents,  the 
agents  it  used  were  very  predictable  since  they  had  simple  and  static  information  sharing 
protocols.  Likewise,  the  Generalized  Partial  Global  Planning  mechanism  has  assumed 
fairly  static  task  structures  and  agents  with  a  limited  range  of  responses. 

•  What  is  the  appropriate  role  for  meta-level  reasoning  for  coordination  in  real-time  situ¬ 
ations; 

•  Can  we  now  fully  realize  the  potential  of  the  Functionally  Accurate/Cooperative  (FA/C) 
distributed  problem  solving  paradigm?  The  FA/C  paradigm  involves  agents  sharing 
information  to  make  up  for  incomplete,  uncertain,  and  incorrect  data.  While  previous 
work  has  showed  that  this  approach  can  be  used  to  deal  successfully  with  incomplete 
data,  little  was  done  with  data  errors  and  other  faults.  This  work  will  allow  us  to  explore 
the  types  of  faults  that  can  be  handled  and  the  cost  involved  (e.g.,  in  terms  of  redundant 
processing). 

•  What  are  the  implications  of  a  distributed  system  being  embeddedwithin  another  system  to 
which  it  must  provide  results?  How  can  fte  goals  of  the  external  system  be  integrated 
with  the  distributed  system  and  how  does  this  affect  coordination  strategies? 

•  How  do  real-time  constraints  affect  cooperation/coordination  strategies  in  distributed 
problem-solving  systems  whose  problems  involve  complex,  interdependent  tasks  (i.e., 
subproblem  interactions).  This  is  not  just  a  matter  of  scheduling  activities  to  meet 
deadlines,  but  includes  satisficing  control  in  which  the  goals  of  the  agents  are  modified. 

•  Can  the  agent  architecture  (laid  out  in  Figure  1)  serve  as  the  basis  of  a  generic  architecture 
for  distribiated  situation  assessment? 

5  Summary  of  Progress  to  Date 

Our  progress  to  this  point  can  be  classified  into  five  main  areas,,  in  which  we  have 

addressed  issues  of  fundamental  importance; 

•  Significant  progress  was  made  in  understanding  how  to  handle  the  transmission  and  use 
of  incomplete  information  and  information  at  different  levels  of  abstraction,  in  DSA 
systems.  This  has  involved  extending  the  agent  belief  modeling  capabilities  of  DRESUN 
agents  in  order  to  facilitate  the  integration  of  such  evidence  and  the  ability  to  locally 
exploit  it.  These  advances  have  resulted  in  additional  flexibility  in  the  local  and  global 
problem-solving  strategies  that  can  be  supported  by  DRESUN  agents. 

•  Though  the  generic  agent  architecture  is  not  complete,  we  believe  that  we  now  have  an 
appropriate  model,  which  simply  needs  some  refinement.  This  model  is  based  on  a 
layered  architecture  (see  Figure  1).  It  has  a  well-defined  interface  between  the  component 
responsible  for  coordination  decisions  and  that  responsible  for  actual  interpretation 
problem  solving.  It  also  has  the  ability  to  handle  real-time  constraints  via  a  scheduling 
module  that  is  relatively  independent,  with  a  well-defined  interface  to  the  other  modules. 
This  agent  model  has  grown  out  of  work  on  the  Generalized  Partial  Global  Planning 
mechanism,  which  uses  the  TAEMS  representation  of  task  structures  and  a  design-to- 
time  real-time  scheduler.  The  design-to-time  scheduling  approach  seems  appropriate  for 
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use  in  DSA  since  it  handles  the  availability  of  multiple  methods  of  varying  quality  to 
accomplish  tasks,  and  the  trade-offs  in  a  satisficing  approach  to  control  under  real-time 
constraints. 

•  We  have  also  expended  a  significant  amount  of  effort  to  improve  the  DRESUN  testbed. 
In  part  this  has  involved  improvments  to  the  implementation  and  its  interface  to  facilitate 
experimentation,  and  porting  the  testbed  it  to  modern  workstation  machines.  It  has  also 
involved  work  to  complete  and/or  extend  the  RESUN/DRESUN  model  to  enhance  and 
explore  its  suitability  as  the  basis  for  a  generic  DSA  architecture.  Particular  areas  of 
development  have  been: 

-  identifying  canonical  evidence  combination  methods  for  the  evidence  summarization 
process; 

-  representing  negative  evidence  uncertainty  in  the  evidence  summaries; 

-  representation  and  summarization  of  approximate  knowledge; 

-  additional  control  plan  sequencing  constructs; 

•  We  have  pursued  several  lines  of  research  involving  the  formal  characterization  and 
analysis  of  DSA  systems.  The  need  for  such  tools  has  become  clear  as  we  have 
addressed  problems  of  increasing  complexity.  Eventually  we  expect  that  these  tools  will 
allow  us  to  formally  analyze  the  performance  of  sophisticated  DSA  systems  in  complex 
environments.  Three  basic  directions  are  being  pursued: 

-  Developed  analysis  tools  for  understanding  coordination  issues  in  DSA  systems; 

-  Developed  an  analysis  and  simulation  framework  for  determining  the  complexity  of 
interpretation  problems; 

-Started  to  formally  analyze  the  solution  quality  of  distributed  interpretation  systems. 

•  We  have  developed  techniques  for  on-line  learning  of  coordination  strategies.  This  has 
been  a  new  area  of  research  that  was  not  anticipated  when  the  work  was  orginally 
proposed.  However,  just  as  the  increasing  problem  and  system  complexity  has 
motivated  use  to  pursue  formal  analysis  techniques,  it  has  also  convinced  us  that  learning 
has  an  important  role  to  play  in  the  development  and  evoluation  of  coordination 
strategies,  system  performance  cannot  be  fully  anticipated. 

More  detailed  descriptions  of  our  research  progress  are  contained  in  the  following  sections 
which  describe  our  results  in  agent  modeling  in  DRESUN;  real-time  coordination;  the 
formal  anlysis  of  coordination,  system  solution  quality,  and  problem  complexity;  and  on¬ 
line  learning  of  coordination  strategies. 

6  Agent  Modeling  in  DRESUN 

One  of  the  major  areas  of  effort  in  developing  the  DRESUN  testbed  has  been  to  address 
issues  related  to  modeling  the  beliefs  of  other  agents.  Modeling  is  important  because  DSA 
agents  typically  must  share  information  in  order  to  satisfy  their  local  goals  as  well  as  the 
overall  system  goals  (since  agent  subproblems  are  interdependent).  DSA  tasks  can  present 
several  sources  of  difficulty  for  information  sharing:  agents'  local  evidence  may  lead  to 
solutions  that  are  globally  inconsistent;  agent  beliefs  (interpretations  of  local  data)  are 
uncertain  and  imprecise;  interpretations  are  complex  structures;  and  beliefs  are  constantly 
being  revised  due  to  new  data  and  further  processing.  When  an  agent  shares  information 
about  its  interpretations  with  another  agent,  that  information  necessarily  produces  evidence 
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in  a  sensor  interpretation  framework  as  an  integral  part  of  the  process  of  using  the 
information — e.g.,  checking  whether  the  information  is  consistent  or  inconsistent  with  the 
local  interpretations.  Interpretation  evidence  based  on  information  shared  by  another  agent 
is  referred  to  as  external  evidence. 

Our  initial  experiments  with  DRESUN  showed  that  extensions  to  the  model  of  external 
evidence  were  necessary  to  effectively  utilize  inter-agent  communication  of  incomplete  and 
conflicting  evidence,  and  evidence  at  multiple  levels  of  abstraction.  The  focus  of  these 
extensions  has  been:  representing  the  uncertainties  that  occur  when  DRESUN  agents 
exchange  such  information,  providing  the  ability  to  reformulate  hypotheses  to  efficiently 
pursue  alternative  interpretations,  and  allowing  numeric  evaluation  of  the  satisfaction  of 
global  termination  criteria.  The  extensions  enhance  the  flexibility  of  the  agents  by 
improving  their  ability  to  evaluate  the  current  state  of  agent  beliefs,  make  better  (local)  use 
of  incomplete  information  from  another  agent,  and  determine  precisely  what  information  is 
needed  to  resolve  global  inconsistencies.  This  effort  turned  out  to  be  much  more  involved 
than  we  had  orginally  forseen. 


Among  the  key  modeling  issues  that  we  have  addressed,  we  have  provided  the  ability  to: 

•  link  multiple  views  of  a  hypothesis  based  on  external  evidence  at  different  levels  of 
abstraction; 

•  link  multiple  hypothesis  extensions  that  are  being  used  in  alternative  local 
interpretations  (e.g.,  different  portions  of  a  single  external  vehicle  track  hypothesis); 

•  create  alternative  local  versions  of  external  hypotheses  based  on  incomplete  information 
and  represent  the  resulting  uncertainties; 

•  reformulate  hypotheses  for  more  efficient  exploration  and  representation  of  alternatives; 

•  communicate  back  results  of  integrating  information  that  was  sent  by  another  agent; 

•  avoid  circular  reasoning  when  exchanging  evidence  among  agents; 

•  identify  when  shared  information  should  be  updated. 

Appendix  A  contains  further  elaboration  of  these  ideas. 

7  Real-Time  Coordination  •  ’ 


q:  63  q:  126  q;  8  q;  16 


Figure  4:  An  example  task  structure  with  2  task  groups. 

The  black  lines  indicate  subtask  relationships.  The  thick  dark  grey  line  is  an  enables  constraint.  The 
thick  light  grey  lines  arc  facilitates  constraints,  which  represent  that  there  are  other  tasks  whose  results  will 
improve  the  quality  of  a  task’s  result  and  or  the  time  to  produce  the  result. 
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The  kinds  of  complexities  that  appear  in  real-time  interpretation  problems  have  motivated  a 
new  framework  for  controlling  real-time  problem  solving  that  we  call  design-to-time  real¬ 
time  scheduling.  In  this  framework  multiple  methods  are  available  for  many  tasks  that 
allow  computation  time  and  value  (precision,  Completeness,  and  certainty)  to  be  traded-off 
to  achieve  acceptable  results  even  with  tight  deadlines  for  tasks.  Design-to-time  can  be 
contrasted  with  the  anytime  algorithm  approach  where  instead  of  a  set  of  methods  for 
completing  a  task  there  is  a  single  anytime  algorithm  that  achieves  increased  value  as  it  is 
given  increased  computation  tim.e.  We  have  found  that  multiple  methods  better  model  the 
kinds  of  tasks  that  are  likely  to  appear  in  sophisticated  interpretation  systems.  An  example 
of  a  task  structure  with  multiple  methods  is  given  in  Figure  4.  A  schedule  for  this  example 
problem  is  given  in  Figure  5. 


m2: 

ml3: 

mil: 

m3: 

q:  243.  d:  4 

q:  6,  d:  2 

q:  3,  d:  3 

q:  63,  d:  6 

0  2  4  6  8  10  12  14  16 

Figure  5:  The  best  schedule  for  the  task  structure  given  in  Figure  1. 

Note  that  mil  and  m3  are  both  facilitated,  resulting  in  higher  quality  and  (in  the  case  of  m3)  lower  duration 
values  than  given  in  their  initial  specification.  The  total  quality  of  this  schedule  is  306. 


One  major  focus  of  the  design-to-time  work  has  been  on  taking  interactions  among  tasks 
into  consideration  when  scheduling.  For  example.  Task  A  might  facilitate  Task  B  if  Task 
A  performs  some  work  that  allows  Task  B  to  be  performed  more  quickly  or  with  higher 
value.  Because  the  design-to-time  scheduling  problem  is  NP-Hard  (in  fact,  of  complexity 
0(m!)  where  m  is  the  number  of  executable  methods)  most  of  our  work  has  used  heuristic 
scheduling  algorithms. 

Recent  design-to-time  work  has  focused  on  negotiation  in  a  multi-agent  problem  solver 
with  design-to-time  scheduling  agents.  This  work  has  been  part  of  the  .development  of  our 
layered  agent  architecture  (see  Figure  1).  We  have  focused  on  the  interface  between  design- 
to-time  agents  and  local  problem  solvers  that  communicate  and  coordinate  with  other 
problem  solvers.  One  important  part  of  this  work  is  the  use  of  commitments  where  one 
agent  commits  to  providing  a  particular  result  by  a  particular  time  to  assist  another  agent. 
Schedulers  need  to  balance  local  problem  solving  concerns  with  concerns  about  providing 
timely  results  to  other  agents.  Appendix  B  contains  further  elaboration  of  these  ideas. 

8  Formal  Analysis  of  Coordination 

A  key  issue  in  the  design  of  distributed  sensor  networks  is  the  organization  of  the  agents: 
the  extent  of  each  agent's  area  of  responsibility  and  the  amount  of  overlap  among  these 
areas.  Overlap  is  an  important  issue  because  while  it  allows  for  redundancy  that  can  make 
up  for  poor  quality  data  and  sensor/node  failure,  processing  can  become  very  inefficient  if 
it  is  not  limited.  One  aspect  of  our  formal  work  has  been  to  understand  the  effects  of 
different  organizations  and  coordination  approaches. 

Our  analysis  initially  focused  on  exactly  what  a  priori  knowledge  agents  had  about  the 
distribution  of  task  groups  in  an  episode.  First  we  looked  at  the  distribution  of  the  lowest- 
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level  sensor  subtasks  of  a  single  task  group  among  multiple  agents  (deriving  the  maximum 
expected  number  of  subtasks),  and  then  we  looked  at  the  distribution  of  task  groups 
themselves.  These  results  were  then  used  to  derive  the  total  amount  of  work,  and  therefore 


Figure  6:  Objective  task  structure  associated  with  a  single  vehicle  track. 

expected  termination  performance,  under  various  organizational  structures  and  control 
schemes.  The  derivation  was  as  follows:  starting  with  the  expected  number  of  low-level 
sensor  subtasks  at  the  most  heavily-loaded  agent  we  derived  the  number  of  task  groups  at 
the  most  heavily-loaded  agent  given  the  number  of  agents  that  see  a  single  task  group  and 
the  environmental  parameters.  We  then  derived  the  number  of  agents  that  will  see  a  single 
task  group,  and  the  structure  of  a  task  group  (Figure  6),  which  allows  us  to  derive  the 
amount  of  work  an  agent  must  do  given  the  number  of  sensor  subtasks  it  executes,  the 
number  of  task  groups  at  the  agent,  and  the  number  of  agents  involved  in  a  single  task 
group.  All  of  these  parts  together  allow  us  to  derive  the  expected  termination  time  of  a 
particular  system  of  agents. 


Figure  7:  On  the  left  is  a  3x3  static  organization,  on  the  right  is  the  dynamic  reorganization 
result  after  agents  3,  4,  5  and  7  attempt  to  reduce  their  areas  of  responsibility  by  one  unit. 
In  this  example  the  corridors  running  North  to  South  have  been  moved  closer  by  two  units 
to  reduce  the  load  on  agents  4,  5,  and  6  in  the  second  column. 
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Our  particular  implementation  of  the  general  idea  of  dynamically  reorganizing  the  partitions 
between  agents  for  the  DSN  simulation  keeps  each  agent's  area  of  responsibility 
rectangular,  and  relaxes  competing  constraints  from  other  agents  quickly  and  associatively 
(the  order  of  message  arrival  does  not  affect  the  eventual  outcome).  To  do  this,  the 
message  sent  by  an  agent  requests  the  movement  of  the  four  corridors  surrounding  an 
agent.  The  northern  corridor  of  Agent  1,  for  example,  is  the  northern  agent  organizational 
responsibility  boundary  shared  by  every  agent  in  the  same  row  as  Agent  1.  As  can  be  seen 
in  Figure  7,  a  3x3  organization  has  four  corridors  (between  rows  1  and  2,  2  and  3,  and 
between  columns  1  and  2,  2  and  3). 

This  work  has  been  used  to  analyze  static  and  dynamic  organizational  structures  for 
distributed  sensor  network  tasks.  We  first  showed  how  the  performance  of  any  static 
organization  can  be  statistically  described,  and  then  showed  under  what  conditions  dynamic 
organizations  do  better  and  worse  than  static  ones.  Finally,  we  showed  how  the  variance 
in  the  agents'  performance  leads  to  uncertainty  about  whether  a  dynamic  organization  will 
perform  better  than  a  static  one  given  only  agent  a  priori  expectations.  In  these  cases,  we 
showed  when  meta-level  communication  about  the  actual  state  of  problem  solving  will  be 
useful  to  agents  in  constructing  a  dynamic  organizational  structure  that  outperforms  a  static 
one.  Viewed  in  its  entirety,  our  work  also  presented  a  methodology  for  answering 
questions  about  the  design  of  distributed  problem  solving  systems  by  analysis  and 
simulation  of  the  characteristics  of  a  complex  environment  rather  than  by  relying  on  single¬ 
instance  examples.  Appendix  C  further  elaborates  on  these  ideas. 

9  Formal  Analysis  of  DSA  Solution  Quality 

While  several  systems  that  use  the  FA/C  approach  have  been  built,  there  has  never  been 
any  formal  analysis  of  the  quality  of  the  solutions  that  are  produced  by  the  approach  or  of 
the  conditions  that  are  necessary  for  the  approach  to  be  successful.  We  have  recently  taken 
a  first  step  toward  being  able  to  formally  analyzing  the  quality  of  solutions  that  can  be 
produced  by  an  FA/C  system.  Two  theorems  have  been  proven  that  compare  the  quality  of 
solutions  produced  by  a  distributed  system  to  the  solutions  that  would  be  produced  by  an 
equivalent  centralized  system.  The  theorems  relate  solution  quality  to  agent  problem¬ 
solving  and  coordination  strategies.  They  show  that  there  are  conditions  under  which  it  is 
possible  to  guarantee  that  the  distributed  system  produces  a  solution  that  is  comparable  to 
the  centralized  solution,  and  other  conditions  under  which  there  is  merely  some  probability 
of  obtaining  such  a  solution.  The  analysis  assumes  the  capabilities  of  an  abstract  model  of 
the  DRESUN  system  for  distributed  sensor  interpretation. 

This  work  is  different  from  most  DAI  work  dealing  with  global  consistency  of  agent 
beliefs,  which  has  focused  on  methods  for  automatically  maintaining  (some  level  of) 
consistency  and  has  used  justification-based  representations  of  belief  (e.g.,  TMSs).  Here, 
we  are  interested  in  analyzing  the  consequences  of  not  maintaining  consistency  (even 
within  individual  agents)  in  terms  of  its  effect  on  solution  quality.  This  is  an  important 
issue  for  problems  like  distributed  sensor  interpretation  where  communication  and 
computation  considerations  often  make  it  impractical  to  maintain  complete  consistency  (and 
where  consistency  is  not  always  necessary  for  determining  an  acceptable  solution). 

We  believe  that  this  line  of  research  will  help  our  understanding  of  the  assumptions  that 
underlie  the  FA/C  model  of  distributed  problem  solving  and  that  it  will  assist  in  producing 
appropriate  propagation  and  coordination  strategies  for  distributed  interpretation  systems.  It 
should  also  make  it  possible  to  analyze  FA/C  frameworks  like  DRESUN  to  determine 
whether  the  framework  is  effective  in  terms  of  making  it  possible  to  use  all  of  the 
information  that  the  (global)  system  has  available  or  not  (and  what  the  effect  there  is  in  not 
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being  able  to  use  all  of  this  information).  We  are  currently  working  to  extend  our  analyses 
in  several  directions:  deriving  specific  probabilities  of  interest  in  connection  with  the 
theorems  as  a  function  of  domain  models  and  acceptance  thresholds;  understanding  how  to 
formalize  the  effects  of  different  evidence  propagation  and  coordination  strategies; 
extending  the  analyses  to  deal  with  processing  of  only  subsets  of  the  available  data;  and 
generalizing  to  CDPS  tasks  other  than  sensor  interpretation.  Appendix  D  contains  further 
elaboration  of  these  ideas. 

10  Analysis  of  the  Complexity  of  Interpretation  Problems 

The  IDP/UPC  formalism  was  developed  for  analyzing  the  relationship  between  the 
performance  of  search-based  interpretation  problem  solving  systems  and  the  inherent 
properties,  or  structure,  of  problem  domains  in  which  they  are  applied.  The  formalism 
models  interpretation  domain  theories  (i.e.,  the  computational  theory  that  is  the  basis  for  a 
problem  solver's  functionality)  in  terms  of  four  feature  structures:  component  (or  syntax), 
utility  (or  credibility),  probability  (or  distribution),  and  cost.  The  different  feature  structures 
are  represented  in  terms  of  a  domain  grammar  and  functions  associated  with  production 
rules  of  the  grammar.  Search  paths  are  represented  graphically  as  interpretation  trees  of  the 
grammar.  By  analyzing  the  statistical  properties  of  the  interpretation  trees  of  a  domain 
grammar,  it  is  possible  to  determine  characteristics  of  problem  solving  such  as  the  expected 
cost  for  a  random  problem  instance  and  the  expected  credibility  of  a  solution. 

Figure  8  shows  an  example  of  a  domain  grammar  used  for  analyzing  sensor  interpretation 
tasks  in  a  vehicle  tracking  domain.  Figure  9  presents  an  example  problem  instance 
generated  with  the  domain  grammar. 

The  most  important  aspect  of  the  IDP/UPC  formalism  is  that  it  supports  a  unified 
representation  of  both  meta-operators  and  domain  processing.  This  unified  representation  is 
based  on  viewing  sophisticated  control  architectures  as  mechanisms  that  evaluate  problem 
solving  actions  by  selectively  applying  a  process  that  examines  a  search  path's  relationship 
with  other,  possibly  interacting,  search  paths.  (Relationships  are  based  on  the  distribution 
of  domain  events  and  are  statistical  in  nature.  For  example,  a  relationship  might  indicate 
that  two  search  paths  lead  to  the  same  final  result  50%  of  the  time.)  Furthermore,  the 
process  of  examining  a  search  path's  relationships  are.yiewed  as  a  distinct  search  operation 
and  not  as  part  of  the  control  architecture.  Interrelationships  between  search  paths  are 
represented  as  abstract  or  approximate  states  that  are  explicitly  created  by  search  operators 
and  not  by  a  monolithic  control  process. 

The  IDP/UPC  formalism  was  developed  to  analyze  the  assumptions  (implicit  and  explicit) 
that  meta-operators  make  about  a  domain.  This  work  represents  an  initial  step  needed  to 
formalize  complex  distributed  search  processes  that  occur  in  DSA  systems.  Appendix  E 
further  elaborates  on  these  ideas. 

11  On-Line  Learning  of  Coordination  Strategies 

Although  coordination  is  an  essential  technique  for  the  design  of  DSA  systems, 
sophisticated  coordination  strategies  are  not  always  cost-effective  in  all  problem-solving 
situations,  nor  is  it  always  possible  to  understand  even  after  the  DSA  system  is  operational 
all  the  task  and  environmental  variables  that  come  into  play  in  deciding  what  form  of 
coordination  at  any  specific  instance  is  most  appropriate.  TTius,  we  have  been  investigating 
an  on-line  learning  method  to  acquire  coordination  plans  for  specific  problem-solving 
situations  so  that  the  appropriate  type  of  coordination  strategy  is  used.  This  effort  was  not 
anticipated  in  the  original  proposed  scope  of  work  but  as  we  have  come  to  understand  the 
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•  :  signal  data  from  sensor.  Each  point  has  a 
time,  location,  energy  level,  and  frequency. 


:  the  actual  path  of  the  vehicle  that  generated  the  data. 
:  a  ghost  image  of  the  actual  path. 


Figure  9:  Problem  Instance  Generated  with  Vehicle  Tracking  Grammar 


design  requirements  of  a  DSA  system  we  realized  the  key  role  such  technology  could  play. 
This  learning  is  accomplished  by  recording  and  analyzing  traces  of  inferences  after  problem 
solving.  The  analysis  identifies  situations  where  inappropriate  coordination  strategies  have 
resulted  in  redundant  activities  or  the  lack  of  timely  execution  of  important  activities,  thus 
degrading  system  performance.  The  analysis  is  also  used  to  create  situation-specific 
coordination  plans  that  use  additional  non-local  information  about  activities  in  the 
networks,  which  are  added  to  the  system  to  remedy  the  problem.  These  situation-specific 
plans  can  decide  (1)  priorities  of  local  and  non-Iocak  activities,  (2)  priorities  of  messages, 
and  (3)  what  non-local  information  is  needed  for  coordination  activitifes.  These  plans  also 
have  the  effect  of  introducing  different  levels  of  coordination  into  this  system,  that  is,  the 
system  can  decide  when  tight  coordination  is  required  and  when  autonomous  processing  is 
more  appropriate.  This  project  investigates  this  approach  using  a  distributed  computer- 
network  monitoring  and  diagnosis  system.  Appendix  F  contains  further  elaboration  of 
these  ideas. 

In  summary,  we  feel  that  results  of  the  research  supported  under  this  contract  have 
contributed  significantly  to  the  eventual  construction  of  practical  Distributed  Situation 
Assessment  systems. 
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13  TECHNOLOGY  TRANSFER 

A  distributed  local-area  network  diagnosis  system  has  been  developed  by  NTT  that 

incorporates  the  techniques  for  on-line  learning  of  coordination  strategies  developed  under 

this  contract. 

The  RESUN  architecture  has  been  reimplemented  in  C++  by  Professor  Hamid  Nawab  of 

Boston  University  to  be  used  as  part  of  a  radar  signal  processing  testbed. 

The  RESUN  interpretation  system  is  being  used  by  Professor  Paul  Cohen  of  University  of 

Massachusetts  in  his  development  of  an  intelligent  assistant  for  data  analysis. 
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Abstract 

This  paper  reports  on  extensions  that  have  been  made 
to  the  DRESUN  testbed  for  research  on  distributed 
situation  assessment  (DSA).  These  extensions  involve 
issues  that  have  arisen  in  modeling  the  beliefs  of  other 
agents  when  dealing  with  inter-agent  communication 
of  incomplete  and  conflicting  evidence,  and  evidence 
at  multiple  levels  of  abstraction.  The  extensions  sup¬ 
port  highly  directed  exchanges  of  evidence  among 
agents  because  they  better  represent  the  uncertainties 
that  occur  when  DRESUN  agents  exchange  incom¬ 
plete  and  conflicting  information.  This  is  important 
in  FA/C  systems  because  agents  must  share  results  in 
order  to  satisfy  their  local  goals  as  well  as  the  overall 
system  goals.  Thus,  sharing  must  be  done  efficiently 
for  an  FA/C  approach  to  be  effective.  These  issues 
will  arise  in  any  distributed  problem  solving  applica¬ 
tion  involving  interacting  subproblems,  when  agents 
must  function  without  complete  and  up-to-date  infor¬ 
mation. 

Introduction 

The  functionally  accurate^  cooperative  (FA/C) 
paradigm  for  distributed  problem  solving  (Lesser  & 
Corkill  1981;  Lesser  1991)  was  proposed  for  applica¬ 
tions  in  which  tasks  are  naturally  distributed  but  in 
which  the  distributed  subproblems  are  not  indepen¬ 
dently  solvable.  In  the  FA/C  approach,  agents  pro¬ 
duce  tentative,  partial  results  based  on  local  informa¬ 
tion  and  then  exchange  these  results  with  the  other 
agents.  The  constraints  that  exist  among  the  agents’ 
subproblems  are  then  exploited  to  resolve  the  local  un¬ 
certainties  and  global  inconsistencies  that  occur  due  to 
the  lack  of  accurate,  complete,  and  up-to-date  local  in¬ 
formation. 

In  (Carver  &  Lesser  1991b)  we  described  the  capa¬ 
bilities  of  the  initial  implementation  of  DRESUN,  a 
testbed  for  research  on  distributed  situation  assess- 

*This  work  was  supported  by  the  Department  of  the 
Navy,  Office  of  the  Chief  of  Naval  Research,  under  contract 
N00014-92-J-1450.  The  content  of  the  information  does  not 
necessarily  reflect  the  position  or  the  policy  of  the  Govern¬ 
ment,  and  no  official  endorsement  should  be  inferred. 
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ment  (DSA)^  using  an  FA/C  approach.  DRESUN 
was  developed  to  explore  the  implications  of  having 
agents  with  more  sophisticated  evidential  representa¬ 
tions  and  control  capabilities  than  the  agents  that  were 
used  in  earlier  research  with  the  Distributed  Vehicle 
Monitoring  Testbed  (DVMT)  (Lesser  &  Corkill  1983; 
Durfee  &  Lesser  1991).  Because  of  agent  limitations, 
that  research  did  not  adequately  address  several  im¬ 
portant  issues  that  arise  when  sharing  incomplete  and 
inconsistent  information  among  DSA  agents.  Further¬ 
more,  overall  agent  activities  were  not  driven  by  an  ex¬ 
plicit  need  to  produce  local  solutions  that  were  globally 
consistent  (let  alone  globally  optimal). 

This  paper  reports  on  extensions  to  the  ini¬ 
tial  DRESUN  testbed,  related  to  modeling  the  be¬ 
liefs/evidence  of  other  agents.  The  basic  DRESUN 
architecture  provides  a  good  basis  for  an  FA/C  ap¬ 
proach  because  inter-agent  subproblem  interactions  are 
explicitly  represented  and  used  to  drive  problem  solv¬ 
ing.  However,  our  experiments  showed  that  extensions 
to  the  model  of  external  evidence^  were  necessary  to 
make  the  most  effective  use  of  inter-agent  communi¬ 
cation  of  incomplete  and  conflicting  evidence,  and  ev¬ 
idence  at  multiple  levels  of  abstraction.  The  focus  of 
these  extensions  has  been  on  representing  the  uncer¬ 
tainties  that  occur  when  DRESUN  agents  exchange 
such  information,  determining  precisely  what  informa¬ 
tion  is  needed  to  resolve  global  inconsistencies,  and 
providing  the  ability  to  reformulate  hypotheses  to  effi¬ 
ciently  pursue  alternative  interpretations. 

These  issues  are  very  important  in  FA/C  systems 
because  agents  must  share  results  in  order  to  satisfy 
their  local  goals  as  well  as  the  overall  system  goals. 


^Situation  assessment  involves  the  fusion  of  sensor  data, 
intelligence  information,  and  so  forth,  and  the  interpreta¬ 
tion  of  this  information  to  produce  a  high-level  description 
of  the  situation  in  the  environment. 

^In  a  DSA  framework,  results  from  another  agent  neces¬ 
sarily  produces  evidence  as  an  integral  part  of  the  process 
of  using  the  information — e.g.,  checking  whether  the  infor¬ 
mation  is  consistent  or  inconsistent  with  the  local  interpre¬ 
tations.  Evidence  based  on  information  shared  by  another 
agent  is  referred  to  as  external  evidence. 


To  resolve  data  uncertainties^  an  agent  must  be  able 
to  evaluate  whether  other  agents’  results  are  consis¬ 
tent  or  inconsistent  with  its  own  results,  and  inte¬ 
grate  these  other  agents’  results  to  revise  and  extend 
its  local  partial  results  (Lesser  &  Corkill  1981).  A 
key  assumption  of  the  FA/C  approach  is  that  agents 
can  do  this  without  “excessive”  communication  among 
the  agents.^  However,  DSA  tasks  can  present  several 
sources  of  difficulty  for  efficient  results  sharing:  agents’ 
local  data  may  lead  to  solutions  that  are  globally  incon¬ 
sistent;  agent  beliefs/results  are  uncertain  and  impre¬ 
cise;  results  (e.g.,  interpretations)  are  complex  struc¬ 
tures;  and  beliefs  are  constantly  being  revised  due  to 
new  data  and  further  processing. 

The  scenario  in  Figure  1  is  an  example  of  a  dis¬ 
tributed  vehicle  monitoring  situation  in  which  local 
solutions  are  inconsistent  and  extended  agent  interac¬ 
tions  are  necessary  to  resolve  the  inconsistency.  We 
will  use  this  example  to  introduce  some  key  issues  in  re¬ 
sults  sharing,  and  we  will  return  to  it  in  more  detail  in 
the  Resolving  Global  Inconsistency  Section,  to  explore 
the  representation  of  external  evidence  in  D RESUN. 

In  the  example,  processing  of  only  their  own  local 
data  would  cause  agent  A  and  agent  B  to  form  the 
track  hypotheses  Ta  and  T^,  respectively.  Because  the 
tracks  extend  through  an  area  of  overlapping  interest, 
the  agents  can  recognize  that  they  must  communicate 
to  verify  the  global  consistency  of  their  local  interpre¬ 
tations.  These  tracks  are  inconsistent  since  they  imply 
that  either  a  single  vehicle  is  in  different  places  at  the 
same  time  or  else  two  vehicles  are  in  the  same  place 
at  the  same  time.  One  important  thing  to  note  here 
is  that  while  exchanging  abstract  results  (the  track 
hypotheses  without  their  supporting  evidential  struc¬ 
tures)  allows  the  inconsistency  to  be  detected,  this  level 
of  information  is  not  sufficient  to  allow  the  inconsis¬ 
tency  to  be  resolved  (i.e.,  there  remains  uncertainty 
about  the  correct  global  interpretation).  This  is  be¬ 
cause  neither  Ta  nor  Tf,  is  significantly  more  likely  than 
the  other  since  each  includes  some  good  quality  data 
and  some  poor  quality  data.  (Even  when  partial  re¬ 
sults  are  “consistent,’'  uncertainties  may  result  when 

^Another  current  research  direction  of  ours  is  formal 
analysis  of  the  FA/C  model  in  terms  of  the  domain  char¬ 
acteristics  necessary  for  the  approach  to  be  effective  and 
the  quality  of  solutions  that  can  be  produced — see  (Carver 
&  Lesser  1994;  Carver  1995).  For  instance,  effective  FA/C 
problem  solving  requires  one  or  more  of  the  following:  1.) 
only  a  subset  of  each  agent’s  subproblems  interact  with 
those  of  other  agents;  2.)  it  can  be  determined  what  lo¬ 
cal  data/abstractions  are  relevant  to  which  other  agents; 
3.)  data  abstractions  (i.e.,  the  tentative,  partial  results) 
can  substitute  for  the  raw  data  in  determining  the  global 
consistency  and  likelihood  of  local  solutions.  If  these  con¬ 
ditions  are  not  satisfied  then  a  centralized  approach  may 
have  better  performance  (though  a  distributed  approach 
may  still  be  preferred  due  to  factors  like  tighter  coupling  of 
the  processor  and  sensor,  or  increased  reliability  and  grace¬ 
ful  performance  degradation). 


only  abstractions  are  exchanged — see  the  Representing 
External  Evidence  Section.) 

Resolving  the  inconsistency  in  favor  of  the  most 
likely  global  interpretation  requires  an  understanding 
of  the  level  of  belief  provided  by  data  supporting  the 
different  portions  of  each  track.  In  other  words,  more 
detailed  information  about  the  interpretation  hypothe¬ 
ses  is  required.  One  way  to  insure  that  agents  have 
all  the  necessary  information  would  be  to  always  com¬ 
municate  the  complete  evidential  structure  associated 
with  the  hypotheses.  However,  because  this  structure 
can  be  very  complex,  communication  and  processing 
limitations  will  typically  make  it  impractical  to  fully 
communicate  this  information."^  Furthermore,  com¬ 
plete  communication  is  often  not  necessary.  For  in¬ 
stance,  we  will  see  that  in  this  example  each  agent 
does  not  have  to  have  complete,  detailed  information 
about  each  other’s  raw  data  in  order  to  resolve  the 
inconsistency. 

Thus,  what  is  needed  is  a  system  with  the  flexibil¬ 
ity  to  request  or  respond  with  information  at  different 
levels  of  detail — based  on  the  dynamic  problem-solving 
requirements — as  part  of  an  incremental  process  of  re¬ 
solving  inconsistency.  This  requires  the  ability  to  in¬ 
tegrate  incomplete  results  information,  represent  the 
resulting  uncertainty,  and  use  this  representation  to 
drive  further  actions.  The  DRESUN  architecture  pro¬ 
vides  these  capabilities. 

Because  of  its  representation  of  inconsistency  as  a 
source  of  uncertainty  and  its  emphasis  on  directed  in¬ 
teractions  to  resolve  inconsistency,  DRESUN  differs 
from  most  DAI  work  dealing  with  the  global  consis¬ 
tency  of  local  agent  beliefs.  This  work  (e.g.,  (Courand 
1990;  Huhns  k  Bridgeland  1991))  has  largely  focused 
on  methods  for  automatically  maintaining  (some  par¬ 
ticular  level  of)  consistency  and  has  used  justification- 
based  representations  of  belief  (e.g.,  TMSs).  DRESUN 
does  not  automatically  enforce  complete  consistency 
because  this  can  be  very  expensive  both  in  terms  of 
communication  and  computation,  and  it  is  usually  not 
necessary.  DRESUN  uses  an  evidential  (partial  beliefs) 
representation  rather  than  a  justification-based  repre¬ 
sentation  of  its  beliefs  because  virtually  all  evidence 
and  hypotheses  in  a  DSA  system  are  uncertain. 

The  next  section  reviews  the  DRESUN  architecture, 
in  which  agent  interactions  are  driven  by  the  need  to  re¬ 
solve  uncertainty  about  the  global  consistency  of  local 
solutions.  This  is  followed  by  a  section  that  examines 
some  of  the  issues  that  have  arisen  in  represent ating  ex¬ 
ternal  evidence  in  DRESUN.  The  example  introduced 
in  this  section  is  then  explored  in  more  detail,  and  the 
paper  concludes  with  a  brief  summary  and  a  discussion 

^The  example  shown  here  is  simplified  to  allow  us  to 
focus  on  our  main  points.  It  shows  only  a  small  fraction 
of  the  data  that  would  be  faced  by  most  real-world  DSA 
systems,  and  it  does  not  show  the  numerous  alternative 
interpretation  hypotheses  that  would  be  interrelated  with 
any  solution  hypotheses. 


The  application  is  vehicle  monitoring.  Agent  A  and  Agent  B  receive  data  only  from  their  own  individual  sensors,  whose 
coverage  regions  overlap.  Agent  A’s  data  is  represented  by  squares  and  agent  B’s  by  circles,  with  positions  as  indicated  at 
the  times  denoted  by  the  associated  numbers.  The  grey  density  of  the  data  points  corresponds  to  the  relative  “quality”  of 
the  data — i.e.,  the  a  priori  likelihood  that  the  data  would  have  resulted  from  a  real  vehicle,  and  the  spectral  content  of  the 
acoustic  signals,  “Empty”  points  denote  data  whose  existence  has  been  assumed  by  the  agents.  Based  on  its  own  data,  each 
agent  would  form  the  local  interpretations  shown:  agent  A  would  hypothesize  vehicle  track  To  and  agent  would  hypothesize 
vehicle  track  Tb.  Ta  covers  agent  A’s  data  from  times  2  through  11,  and  Tb  covers  agent  B’s  data  from  times  1  through  10. 
The  preferred  global  interpretation — given  a  complete  view  of  the  data  from  both  agent  A  and  agent  B — is  To+b  because  it 
covers  more  high  quahty  data  than  either  of  the  local  tracks  (the  remaining  uninterpreted  data  is  due  to  ghosting  phenomena 
and  may  or  may  not  be  explicitly  interpreted  depending  on  the  termination  criteria  of  the  system).  Ta+6  covers  agent  B’s 
data  from  times  1  through  6  and  agent  A’s  data  from  times  5  through  11  (it  covers  both  agents’  consistent  data  at  times  5 
and  6). 


of  current  research  issues. 

DRESUN 

DRESUN  agents  are  RESUN  interpretation  sys¬ 
tems  (Carver  Lesser  1991a;  Carver  &  Lesser  1993). 
One  of  the  key  ideas  in  RESUN  is  the  use  of  sym¬ 
bolic  source  of  uncertainty  statements  (SOUs)  in  the 
evidence  for  the  interpretation  hypotheses.  The  SOUs 
allow  agents  to  understand  the  reasons  why  their  hy¬ 
potheses  are  uncertain  and  why  their  termination  cri¬ 
teria  remain  unmet.  RESUN  also  supports  the  use  of 
satisficing  control  and  heuristically  controlled,  approx¬ 
imate  evaluation  of  hypothesis  belief  to  deal  with  the 
computational  complexity  of  DSA  problems  (Carver  & 
Lesser  1994). 

In  an  FA/C  system,  there  must  be  some  mechanism 
to  drive  the  exchange  of  results  among  the  agents  so 
that  incorrect  and  inconsistent  local  solutions  can  be 
detected  and  dealt  with.  Ideally,  this  would  be  ac¬ 
complished  with  a  mechanism  that  allows  agents  to 
understand  where  there  are  constraints  among  their 
subproblems,  so  that  information  interchange  could 
be  highly  directed.  DRESUN  provides  just  this  ca¬ 
pability  for  DSA  applications.  DRESUN  agents  create 
global  consistency  SOUs  whenever  it  is  determined  that 
a  local  hypothesis  can  obtain  evidence  from  another 
agent — i.e.,  whenever  a  subproblem  interaction  (con¬ 


straint)  is  detected.  These  SOUs  are  viewed  as  sources 
of  uncertainty  about  the  correctness  of  an  agent’s  local 
solution  because  they  represent  unresolved  questions 
about  the  global  consistency  of  the  solution.  Exam¬ 
ples  of  situations  involving  each  of  the  global  consis¬ 
tency  SOUs  are  shown  in  Figure  2. 

DRESUN’s  global  consistency  SOUs  make  explicit 
the  possible  interrelationships  between  agents’  local 
subproblems,  and  provide  an  integrated  view  (in  con¬ 
junction  with  the  standard  RESUN  SOUs)  of  both  the 
local  and  global  problem-solving  goals,  which  drive 
agent  control  decisions.  Thus  in  DRESUN,  agents 
exchange  results  based  on  the  goal  of  insuring  the 
global  consistency  of  their  local  solutions.  So  far, 
though,  we  have  not  discussed  what  “resolving”  a 
global  SOU  means.  Resolution  of  a  global  SOU  in¬ 
volves  exchanging  information  among  the  associated 
agents  so  as  to  effectively  propagate  evidence  between 
their  hypothesis-evidence  networks,^  An  example  of 
the  resolution  of  a  global  SOU  is  shown  in  Figure  3. 
Resolution  of  global  SOUs  is  analogous  to  (intra-agent) 
evidence  propagation,  and  as  with  evidence  propaga¬ 
tion  there  are  a  range  of  strategies  that  may  be  used 
to  determine  which  global  SOUs  to  pursue  and  how 

^ While  there  are  significant  differences  between  these 
networks  and  Bayesian  or  belief  nets,  for  our  purpose  here 
the  reader  can  consider  them  to  be  similar. 


Consistent  Overlapping  Mode! 


Consistent  Global  Extension 


Consistent  Global  Evidence 


Figure  2:  Examples  of  the  global  consistency  SOUs  for  vehicle  monitoring. 

There  are  three  types  of  global  consistency  interactions  in  sensor  interpretation  problems:  interpretations  in  regions  of 
overlapping  interest  among  agents  must  be  consistent,  “continuous”  hypotheses  (e,g.,  vehicle  tracks)  that  would  extend 
into  other  agents’  areas  must  have  consistent  external  extensions,  and  hypotheses  that  require  evidence  that  could  be  in 
another  agent’s  area  (e.g.,  the  source  for  a  ghost  track  hypothesis)  must  have  appropriate  external  evidence.  Instances 
of  these  situations  can  be  detected  given  the  domain  model  and  knowledge  of  the  organization  of  agent  interest  areas. 
DRESUN  uses  three  global  consistency  SOUs  to  denote  instances  of  these  global  interactions:  consistent-overlapping-model^ 
consistent- global- extension,  and  consistent- global- evidence. 


completely  to  propagate  their  effects.  We  will  not 
explore  the  issues  involved  in  these  choices  here,  but 
see  (Carver  Sz  Lesser  1994). 

Representing  External  Evidence 

The  DRESUN  architecture  provides  the  basis  for  ef¬ 
fective  FA/C  problem  solving  because  it  explicitly  rep¬ 
resents  the  inter-agent  subproblems  interactions  and 
uses  this  information  to  drive  the  exchange  of  partial 
results  among  the  agents.  However,  our  initial  experi¬ 
mentation  with  DRESUN  found  that  there  were  some 
restrictions  on  the  coordination  strategies  that  could 
be  supported  because  of  an  inability  to  represent  the 
uncertainties  that  arise  when  using  incomplete  results 
information  and  results  information  at  multiple  levels 
of  abstraction.  This  section  will  try  to  make  clear  the 
differences  between  representing  local  evidence  and  ex¬ 
ternal  evidence,  by  focusing  on  the  problems  that  arise 
when  evaluating  the  effect  of  (both  consistent  and  in¬ 
consistent)  incomplete  external  evidence. 

First,  we  must  make  the  concept  of  global  consis¬ 
tency/inconsistency  more  precise.  In  interpretation 
problems,  data  and  hypotheses®  are  consistent  if  they 
can  be  merged  into  a  single  valid  interpretation  hy¬ 
pothesis  or  if  they  relate  only  to  completely  indepen¬ 
dent  top-level  hypotheses.  For  example,  two  vehicle 
track  hypotheses  that  overlap  in  time  are  consistent 
if  their  (imprecise)  vehicle  type  parameters  are  con¬ 
sistent,  if  their  (imprecise)  positions  intersect  at  the 
overlapping  times,  and  if  their  positions  for  the  non¬ 
overlapping  times  are  consistent  with  vehicle  move- 

®  When  we  talk  about  “hypotheses”  in  this  paper,  we  are 
really  referring  to  what  are  called  hypothesis  extensions  in 
RESUN/DRESUN  (Carver  Sz  Lesser  1991a). 


ment  constraints.  Consistency  checking  is  straightfor¬ 
ward  in  DRESUN.^ 

When  consistent  local  hypotheses  are  merged,  agents 
have  all  the  evidence  necessary  to  construct  a  complete 
new  hypothesis.  For  example,  in  the  consistent  local 
evidence  example  in  Figure  4,  the  supporting  data  of 
Ti  and  T2  can  be  used  to  create  a  new  hypothesis  73. 
Now,  consider  the  case  in  which  T2  is  an  external  hy¬ 
pothesis,  and  the  local  agent  does  not  have  (immedi¬ 
ate  access  to)  any  of  T2’s  supporting  evidence.  In  this 
case,  the  local  agent  can  also  create  a  new  hypothe¬ 
sis  T3,  which  has  the  same  attributes  (e.g.,  positions 
and  vehicle  ID)  as  the  Ts  created  in  the  local  evidence 
case.  However,  without  access  to  the  evidence  for  the 
external  72,  the  belief  in  this  T3  cannot  be  properly 
evaluated.  Evaluating  the  belief  in  T3  requires  knowl¬ 
edge  of  the  quality  of  the  data  for  each  of  supporting 
vehicle  positions,  but  alf  the  local  agent  has  access  to 
is  the  overall  belief  in  T2 — which  depends  on  the  qual¬ 
ity  of  the  data  from  the  overlapping  positions  as  well 
as  the  positions  that  extended  Ti.  While  the  belief 
in  T3  might  be  estimated  from  this  evidence  (assum¬ 
ing,  for  instance,  that  T2’s  overlap  data  is  of  about  the 
same  quality  as  Ti ’s),  the  resulting  belief  rating  will  be 
uncertain. 

One  of  the  characteristics  that  makes  sensor  inter¬ 
pretation  difficult  is  that  inconsistency  (i.e.,  alternative 
interpretations  of  the  data)  leads  to  complex  eviden¬ 
tial/belief  interrelationships  among  hypotheses.  In  the 

^DRESUN  requires  that  hypotheses  have  sufficient  at¬ 
tributes  to  be  able  to  judge  the  consistency  of  new  evidence 
without  having  to  have  access  to  all  the  existing  evidence 
for  the  hypotheses.  Thus,  the  consistency  of  two  track  hy¬ 
potheses  can  be  judged  without  requiring  access  to  their 
supporting  evidence. 


Agent  1 


Agent  2 


Figure  3:  An  example  of  the  resolution  of  a  global  consistency  SOU. 

When  there  is  a  consistent  explanation  in  the  external  agent,  resolution  of  the  global  SOU  associated  with  h\  results  in  the 
creation  of  a  merged  hypothesis  as  a  new  alternative  explanation  in  each  agent.  When  the  local  hypothesis  is  inconsistent 
with  hypotheses  in  the  external  agent,  new  alternatives  may  be  created  (as  shown  here).  When  the  local  hypothesis  is 
inconsistent  with  the  data  in  the  external  agent,  new  evidential  links  are  created  to  represent  the  contradictory  evidence. 


case  of  only  local  evidence,  these  relationships  can  at 
least  be  properly  evaluated.  For  example,  in  the  in¬ 
consistent  local  evidence  example  in  Figure  4,  suppose 
that  Ti  and  T2/  overlap  at  V3  and  V4,  but  are  inconsis¬ 
tent.  The  inconsistency  is  recognized  because  Ti  and 
T2/  are  alternative  explanations  for  the  shared  V3  and 
V4  support,  which  allows  the  negative  evidential  rela¬ 
tionship  between  Ti  and  T2/  to  be  evaluated  properly. 

Now,  consider  the  case  in  which  Ti  and  T2/  are  in¬ 
consistent,  but  T2*  is  an  external  hypothesis.  It  is  still 
straightforward  to  detect  the  inconsistency.  However, 
because  the  local  agent  does  not  have  any  of  T2/  ’s  sup¬ 
porting  evidence,  this  inconsistency  can  be  represented 
only  as  negative  evidence  for  Ti — which  again  makes 
it  impossible  to  precisely  evaluate  the  belief  in  T\, 
First,  the  effect  that  alternative  interpretations  have 
on  each  other’s  belief  depends  on  the  relative  belief  of 
the  shared  and  non-shared  portions.  For  instance,  if 
the  belief  in  T2/  is  largely  due  to  the  quality  of  the 
overlap  data,  then  does  not  represent  strong  be¬ 
lief  against  T\.  Second,  evidential  propagation  now 
does  not  automatically  reflect  correct  beliefs  if  there 
are  other  interrelated  hypotheses.  Suppose,  for  exam¬ 
ple,  that  there  are  additional  local  hypotheses  that  are 
inconsistent  with  Ti  (because  they  are  alternative  ex¬ 
planations  for  some  of  Ti ’s  support).  These  hypotheses 
may  also  be  inconsistent  with  the  external  T2/,  or  they 
may  be  consistent  with  it.  Unless  these  relationships 
are  explicitly  examined,  this  will  lead  to  additional  un¬ 
certainty  in  the  belief  in  Ti , 

This  brief  example  shows  that  communication  of  ab¬ 
stract,  incomplete  hypothesis  information  can  lead  to 


uncertainty  about  the  effect  of  external  evidence  on 
local  hypotheses.  Communicating  incomplete  informa¬ 
tion  can  still  be  useful,  however.  In  some  situations  the 
uncertainty  may  not  be  critical  to  resolve,  and  if  it  is,  it 
may  be  able  to  be  used  to  guide  further  communcation. 
We  have  addressed  this  and  a  number  of  other  repre¬ 
sentation  issues  in  our  extensions  to  DRESUN.  Several 
of  these  extensions  will  be  described  in  the  example  in 
the  next  section.  As  part  of  our  extension  of  the  model 
of  external  evidence,  we  have  given  DRESUN  agents 
the  ability  to: 

•  link  multiple  views  of  a  hypothesis  based  on  external 
evidence  at  different  levels  of  abstraction; 

•  link  multiple  hypothesis  extensions  that  are  being 
used  in  alternative  local  interpretations  (e.g.,  differ¬ 
ent  portions  of  a  single  external  vehicle  track  hy¬ 
pothesis); 

•  locally  create  alternative  versions  of  external  hy¬ 
potheses  based  on  incomplete  information  and  rep¬ 
resent  the  resulting  uncertainties; 

•  reformulate  hypotheses  for  more  efficient  exploration 
and  representation  of  alternatives; 

•  communicate  back  results  of  integrating  information 
that  was  sent  by  another  agent; 

•  avoid  circular  reasoning  when  exchanging  evidence 
among  agents; 

•  identify  when  shared  information  should  be  updated. 

Resolving  Global  Inconsistency: 

An  Example 

In  this  section,  we  will  return  to  the  example  of  Fig¬ 
ure  1.  Our  purpose  will  be  to  show  how  DRESUN ’s 
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Figure  4:  Examples  of  the  differences  in  representing  local  and 


extended  representation  of  external  evidence  provides 
the  flexibility  to  be  very  efficient  about  the  information 
that  must  be  communicated  among  agents  to  resolve 
global  inconsistencies.  For  the  purpose  of  our  presen¬ 
tation,  we  will  assume  that  agent  A  and  agent  B  have 
already  formed  their  local  track  hypotheses  (T^  and  T^) 
that  extend  through  the  overlap  region.  This  results 
in  consistent- overlapping-model  SOUs  being  posted  in 
each  agent’s  PS-Model.  We  also  will  assume  that  these 
SOUs  are  not  pursued  until  some  level  of  confidence  in 
the  track  hypotheses  is  reached  (based  on  the  local  ev¬ 
idence),  and  that  agent  A  is  the  first  to  communicate 
about  its  SOU. 

When  agent  A  starts  initiates  a  dialog  (with  agent 
B)  to  resolve  its  “overlap”  SOU,  there  are  two  options 
depending  on  whether  agent  A  thinks  the  bulk  of  the 
processing  to  check  consistency  should  be  done  by  itself 
or  by  agent  B:  it  could  request  agent  B  to  send  its  best 
interpretations  that  cover  the  overlap  region  and  then 
check  consistency  itself  or  it  could  send  track  Ta  to 
agent  B  and  let  that  agent  check  consistency.  Likewise, 
if  agent  A  chooses  to  send  Ta  it  hais  several  options  in 
terms  of  the  amount  of  detail  it  sends  about  Ta,  or  if 
agent  B  is  requested  to  send  back  its  interpretations 
it  has  several  representation  options.  Here,  we  make 
the  assumption  that  agent  A  will  handle  consistency 
checking  and  that  potential  solutions  will  initially  be 
communicated  at  their  most  abstract  level:  sending 
only  the  attributes  and  degree  of  belief  in  the  most 
likely  top-level,  track  hypotheses. 

Given  these  decisions,  agent  A  requests  that  agent 


B  send  it  any  relevant  potential  solutions  and  agent  B 
responds  with  track  Tj.  Agent  A  finds  that  track  Tf, 
is  inconsistent  with  its  own  track  Ta  since  the  tracks 
overlap  but  cannot  be  merged.  Because  T^  is  inconsis¬ 
tent  with  Ta,  negative  external  evidence  is  created  for 
Ta .  This  is  the  second  stage  of  agent  A’s  representation 
shown  in  Figure  5.  The  creation  of  this  negative  ex¬ 
ternal  evidence  will  cause  a  global-inconsistency  SOU 
to  be  added  to  agent  A’s  PS-Model.  Whether  or  not 
this  “inconsistency”  SOU  results  in  further  communi¬ 
cation  or  other  processing  depends  on  several  factors, 
including:  the  original  belief  in  Ta,  the  uncertainty 
abourthe  magnitude  of  the  (negative)  effect  that  T^ 
has  on  Ta  due  to  incomplete  information  about  the 
external  hypothesis  (as  described  in  the  Representing 
External  Evidence  Section),  the  ability  of  agent  A  to 
pursue  other  sources  of  uncertainty  in  Ta  (to  locally 
resolve  the  uncertainty  in  Ta),  the  general  classes  of 
uncertainty  affecting  other  hypotheses,  the  global  con¬ 
sistency  termination  criteria,  and  so  on. 

Assuming  that  the  agent  chooses  to  pursue  the  “in¬ 
consistency”  SOU,  it  first  identifies  plans  that  are  rel¬ 
evant  to  resolving  the  SOU.  One  plan  that  we  have 
developed  for  resolving  a  global-inconsistency  SOU  is 
applicable  only  when  the  inconsistency  involves  track 
hypotheses  that  are  partially  consistent.  This  plan  at¬ 
tempts  to  construct  a  new  extension  of  the  local  track 
that  is  consistent  with  a  portion  of  the  external  ev¬ 
idence.  Exactly  which  of  the  possible  alternatives  it 
initially  chooses  to  create  depends  on  the  information 
it  has  about  the  relative  credibility  of  the  various  por- 
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Figure  5:  Agent  A’s  evolving  representation  of  evidence  for  the  example. 


tions  of  the  inconsistent  tracks.  Here,  agent  A  knows 
about  the  credibility  of  portions  of  only  its  own  track 
Tq:  the  support  from  times  5  through  11  is  quite  strong 
and  that  from  times  2  through  4  is  weak  (see  Figure  1). 
Starting  with  the  consistent  portion  of  Ta  at  times  5 
and  6,  and  the  better  supported  portion  from  times 
7  through  11,  agent  A  decides  to  create  a  new  track 
extension  using  local  evidence  from  times  5  through 
11, 

In  the  third  stage  of  Figure  5,  agent  A  has  created 
track  Ta^b  based  on  (5  — 11)  (one  of  the  intermediate 
extensions  of  Ta)  and  Ti(l  — 6).  T6(l  — 6)  is  an  interme¬ 
diate  extension  of  the  external  track  Tj  that  has  been 
locally  hypothesized  by  agent  A — i.e.,  agent  A  has  cre¬ 
ated  this  extension  without  knowing  whether  agent  B 
has  a  representation  of  this  version  of  and  without 
knowing  the  degree  of  belief  in  this  portion  of  T^.^  Be¬ 
cause  agent  A  lacks  both  the  supporting  evidence  and 


®  Agent  B  may  or  may  not  have  created  this  particular 
extension  depending  on  how  it  gathered  the  evidence  to 
construct  T6(l  —  10).  Agent  B  may  have  to  reformulate  its 
view  of  Tb  in  order  to  create  T6(l  -  6). 


the  overall  belief  rating  for  T6(l  -  6),  there  is  consid¬ 
erable  uncertainty  about  the  belief  rating  for  Ta+h  (re¬ 
member,  it  has  only  the  overall  rating  for  Tb{l  -  10)). 
The  reasons  for  its  uncertainty  are  represented  by  an 
SOU  that  is  posted  with  Ta+6. 

Assuming  that  agent  A  decides  to  pursue  Ta+6  fur¬ 
ther,  since  it  is  a  credible  globally  consistent  solution, 
this  SOU  drives  the  selection  of  a  plan  that  requests 
agent  B  to  communicate  the  support  belief  summaries 
for  Tfc(l  —  6),^  When  the  requested  information  is  re¬ 
ceived,  it  is  integrated  into  agent  A’s  incomplete  rep¬ 
resentations  of  Tb{l  -  6)  and  Ta+fe.  The  result  of  this 
process  is  shown  in  the  final  stage  of  Figure  5.  With 
this  level  of  information,  agent  A  can  evaluate  the  like¬ 
lihood  of  Ta+6  (given  the  evidence  gathered  so  far  by 
itself  and  agent  B).  Depending  on  the  results  of  this 
evaluation  and  the  termination  criteria,  agent  A  may 
then  consider  Ta^b  to  be  a  (likely)  solution  or  to  not  be 
a  solution,  or  it  may  need  to  try  to  gather  additional 


®This  information  is  provided  in  our  application  by  the 
belief  ratings  at  the  vehicle  (position)  level  and  in  the 
uncertain- support  SOUs  at  the  track  level. 


evidence  to  resolve  the  remaining  uncertainty. 

This  example  shows  how  D RESUN  agents  can  carry 
on  dialogs  in  order  to  resolve  global  inconsistencies.  It 
also  shows  that  these  dialogs  can  be  directed,  using 
information  at  appropriate  levels  of  detail,  in  order  to 
limit  the  amount  of  information  that  must  be  com¬ 
municated  (and  integrated).  In  this  example,  agent 
A  does  not  have  to  have  complete  knowledge  of  T&’s 
supporting  evidential  structure  or  its  raw  data.  All 
that  is  needed  is  information  about  the  “quality”  of 
the  data  sets  supporting  T^.  In  fact,  because  agent  A 
was  able  to  to  construct  alternative  hypotheses  based 
on  its  local  data  and  an  incomplete  view  of  Tj,  it  was 
able  to  limit  the  information  it  required  to  just  a  por¬ 
tion  of  Tfc’s  support.  The  flexibility  to  do  this  sort  of 
local  processing  is  possible  because  DRESUN  agents 
represent  the  uncertainty  that  results  from  the  use  of 
incomplete  external  evidence. 

Conclusions 

In  this  paper,  we  have  discussed  some  of  the  issues 
that  can  arise  in  a  DSA  system  when  sharing  incom¬ 
plete  or  inconsistent  information,  or  information  at 
different  levels  of  detail.  These  are  important  issues 
since  the  sharing  of  partial  results  is  a  critical  element 
of  an  FA/C  approach  to  distributed  problem  solving. 
We  have  also  shown  how  the  agent  architecture  of  the 
DRESUN  testbed  has  been  extended  to  give  the  agents 
the  flexibility  to  communicate  information  in  a  very  di¬ 
rected  manner.  These  extensions  have  focused  on  rep¬ 
resenting  the  uncertainties  that  occur  when  DRESUN 
agents  exchange  incomplete  information,  determining 
precisely  what  information  is  needed  to  resolve  global 
inconsistencies,  and  reformulating  hypotheses  to  more 
efficiently  pursue  alternative  interpretations. 

Because  DRESUN  supports  a  range  of  methods  for 
resolving  interpretation  uncertainty  and  global  incon¬ 
sistency,  coordination  strategies  must  consider  a  va¬ 
riety  of  questions  about  whether/when/how  to  pursue 
interpretations  and  SOUs  (the  example  in  the  previous 
section  mentioned  a  number  of  options  faced  by  the 
agents).  We  are  pursuing  both  analytical  and  experi¬ 
mental  approaches  to  determine  appropriate  coordina¬ 
tion  strategies  (Decker  &  Lesser  1995),  are  analyzing 
the  quality  of  solutions  that  can  be  produced  by  FA/C- 
based  DSA  systems  (Carver  &  Lesser  1994),  and  are 
developing  methods  for  analyzing  the  inherent  com¬ 
plexity  of  interpretation  scenarios  (Whitehair  k  Lesser 
1993).  Since  it  is  difficult  to  evaluate  a  framework  in¬ 
dependently  of  the  strategies  that  are  encoded  within 
it,  the  development  of  suitable  coordination  strategies 
is  a  major  focus  of  our  current  research. 
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Abstract 

When  solving  real-time  AI  problems,  it  is  "uften  useful  to  have  multiple 
methods  available  for  solving  subproblems.  This  allows  acceptable  solutions  to  be 
found  even  in  tight  deadline  situations.  However,  it  requires  that  problem  solvers 
have  both  a  simple  planning  component  (to  decide  which  methods  to  use)  and 
a  scheduling  component  (to  schedule  the  execution  of  the  chosen  methods).  In 
this  paper  we  describe  the  design-to-time  approach  to  real-time  problem  solving 
that  uses  this  integration  of  planning  and  scheduling  to  find  solutions  to  hard 
real-time  problems.  We  describe  the  design-to-time  algorithm  and  provide  some 
experimental  results  that  show  the  usefulness  of  this  algorithm. 
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1  Introduction 


In  our  work  on  real-time  artificial  intelligence  (AI)  we  have  found  it  useful  to  integrate 
a  simple  form  of  planning  with  scheduling  to  find  good  solutions.  Planning  is  involved 
because  multiple  (often  approximate)  methods  are  available  for  many  subproblems  and 
it  is  necessary  to  decide  which  methods  to  execute.  Having  multiple  methods  available 
allows  solutions  to  be  found  for  a  broad  range  of  environmental  situations.  Scheduling 
is  involved  because  there  are  hard  deadlines  by  which  subproblems  must  be  completed, 
so  it  is  necessary  to  understand  when  particular  methods  will  complete  execution  and 
what  results  can  be  expected. 

Steve  Smith  has  talked  about  the  usefulness  of  integrating  planning  and  scheduling 
in  complex,  resource-constrained  problems  [9].  He  discusses  the  need  to  combine 
deciding  “what”  to  do  (i.e.,  finding  ways  to  satisfy  goals  from  more  basic  descriptions 
of  how  domain  actions  affect  the  world)  with  “when”  to  do  it  (i.e.,  allocating  resources 
over  time  to  tasks  in  a  way  that  maximizes  overall  system  performance).  While  our 
approach  is  quite  different  from  Smiths,  the  basic  idea  of  integrating  planning  and 
scheduling  to  solve  problems  is  shared. 

Our  approach  to  real-time  problem  solving  is  known  as  design- to-time  [5,  6]  and 
integrates  planning  and  scheduling  to  solve  real-time  problems  where  multiple  methods 
are  available  for  many  subproblems.  Our  integration  differs  from  Smiths  in  that  our 
planning  and  scheduling  both  address  different  kinds  of  problems  in  different  ways.  Our 
simple  planner  has  to  decide  which  of  many  combinations  of  solution  methods  to  use  to 
solve  each  problem,  rather  than  determining  from  first  principles  what  operators  could 
be  applied  in  each  situation.  Unlike  Smith,  our  scheduler  does  not  schedule  resources 
(other  than  a  CPU)  and  it  schedules  a  single  line  of  control.  This  differentiates  our 
work  from  much  of  the  standard  AI  scheduling  work  [8,  3,  7],  although  (as  discussed 
at  the  end  of  the  paper)  we  have  begun  to  extend  our  scheduling  mechanism  to  handle 
multiple  lines  of  control  and  other  resources.  Figure  1  is  a  high  level  depiction  of  the 
design-to-time  algorithm. 

In  design-to-time  we  frame  the  input  problem  in  a  generic  representation  of  task 
structures  known  as  TJEMS  [2,  1].  In  T^MS  we  can  describe  problems  in  terms  of 
how  quality  is  incrementally  accumulated  over  time,  what  soft  and  hard  interactions 
there  are  between  tasks,  and  how  much  uncertainty  there  is  in  method  quality  and 
duration.  Figure  2  shows  an  example  of  a  simple  Ti^MS  task  structure,  in  this  case  one 
that  describes  the  problem  of  making  coffee.  Note  that  there  are  multiple  methods  for 
solving  each  of  the  three  subtasks  of  the  main  problem. 

The  T/EMS  framework  gives  us  what  might  be  called  a  partially  digested  description 
of  a  problem.  We  do  not  have  to  plan  completely  from  scratch,  deciding  what  operators 
apply  in  a  given  situation  and  how  they  can  be  combined  to  solve  problems.  That 
information  is  available  to  us  in  the  TJEMS  task  structure.  What  we  need  to  do 
is  determine  which  of  the  many  options  available  in  that  task  structure  we  should 
actually  pursue.  Because  we  are  dynamically  combining  options  together  (rather  than 
using  predefined  process  plans)  T/EMS  needs  to  explicitly  represent  the  soft  interactions 
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between  tasks  that  could  normally  be  built  in  to  the  process  plan  for  a  task,  because  the 
exact  effect  of  the  interactions  depends  on  the  context  in  which  the  tasks  are  executed. 


duration:  10  min  duration:  10  min 


Figure  2:  An  example  of  TMMS  task  structure  to  be  scheduled.  The  black  lines  represent 
task/subtask  connections,  the  thin  gray  line  represents  a  facilitates  relationship  and  the 
thick  gray  lines  represent  enables  relationships. 

In  a  Ty^MS  task  structure  the  leaves  of  the  graph  represent  executable  computations 
(known  as  methods)  and  the  nonleaf  nodes  represent  tasks  that  achieve  quality  as  a 
function  of  the  quality  of  their  subtasks.  Each  separate  graph  is  known  as  a  task 
group  and  represents  a  single  independent  problem  to  be  solved.  Each  task  group  has  a 
deadline  by  which  all  computation  on  that  task  group  must  be  completed.  Non-parent- 
child  connections  between  tasks  and  methods  represent  interactions,  such  as  enables 
(Task  A  must  have  quality  greater  than  a  threshold  before  Method  B  can  correctly  begin 
execution)  and  facilitates  (if  Task  A  has  quality  greater  than  a  threshold,  then  Method 
B  will  have  reduced  duration  and/or  increased  quality). 

The  goal  of  our  design-to-time  scheduling  algorithm  is  to  maximize  the  overall 
quality  achieved.  What  quality  means  depends  ^-on  the  particular  application,  but 
components  often  include  completeness  (a  more  complete  answer  "usually  has  higher 
value),  certainty  (increasing  the  likelihood  that  a  result  is  correct  can  increase  its  quality), 
accuracy  (a  more  accurate  answer  often  has  higher  quality)  and  cost  (the  quality  of  a 
solution  could,  for  example,  be  inversely  related  to  the  cost,  where  cost  is  a  measure  of 
non-CPU  resources  required  for  a  solution). 

In  the  coffee  example  from  Figure  2  overall  quality  for  the  task  group  is  the  minimum 
of  the  qualities  achieved  by  acquiring  beans,  grinding  and  brewing.  Acquiring  beans 
enables  grinding.  You  have  to  have  the  beans  before  you  can  grind  them.  Grinding 
enables  brewing.  You  cant  actually  brew  coffee  until  you  have  ground  the  beans. 
Each  of  these  intermediate  tasks  has  two  different  methods  for  achieving  it  that  have 
different  expected  quality  and  duration  values.  Using  Yuban  from  the  freezer  reduces 
the  expected  duration  of  the  grinding  task  (because  presumably  the  Yuban  is  already 
ground,  although  it  might  be  desirable  to  grind  it  more  finely).  This  translates  to  a 
facilitates  in  T/EMS. 

Another  issue  that  can  arise  in  problems  of  this  form  is  uncertainty.  Often  we 
might  only  approximately  know  how  long  a  particular  solution  method  will  take  or 
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what  quality  of  solution  to  expect.  Maybe  there  will  be  a  big  line  at  Starbucks  today 
(increased  duration)  or  maybe  they  will  be  out  of  our  favorite  variety  and  we  will  have 
to  settle  for  something  else  (reduced  quality). 

Given  an  input  problem  of  this  form,  the  design-to-time  algorithm  first  generates 
a  set  of  alternatives,  are  unordered  combinations  of  executable  methods  that  it 

thinks  might  be  good  for  solving  the  problem.  These  alternatives  are  chosen  without 
taking  interactions  and  deadlines  into  account  (because  taking  them  into  account 
requires  actual  scheduling,  which  is  the  next  step  of  the  algorithm).  These  alternatives 
are  generated  using  a  dynamic  programming  technique  and  represent  a  good  cross- 
section  of  possible  solutions  with  different  resource  (in  this  case  CPU)  requirements. 
This  is  the  planning  part  of  the  algorithm  that  is  choosing  which  of  many  possible 
combinations  of  methods  are  worth  considering  in  detail. 

Some  of  the  alternatives  that  would  be  generated  for  the  coffee  making  example 
include: 

Yuban-grinder-boil  expected  quality:  3  (maximum  of  3,  7,  5),  expected  duration:  5 
min.  40  sec. 

Starbucks-mill-drip  expected  quality:  10,  expected  duration:  40  min. 
Starbucks-grind-boil  expected  quality:  5,  expected  duration:  25  min.  30  sec. 
Starbucks-grind-drip  expected  quality:  7,  expected  duration:  30  min.  30  sec. 

After  the  initial  set  of  alternatives  is  generated  the  algorithm  goes  into  a  loop  of 
choosing  an  alternative  to  schedule,  scheduling  the  chosen  alternative  and  evaluating  the 
resulting  schedule,  which  may  result  in  new  alternatives  being  generated.  Choosing  an 
alternative  to  schedule  is  done  in  a  heuristic  manner  that  takes  into  account  deadlines, 
the  best  schedules  found  so  far  and  the  expected  results  of  the  alternatives.  A  schedule 
is  produced  for  an  alternative  using  a  greedy  heuristic  approach  takjng  into  account 
factors  such  as  expected  quality  and  duration,  whether  a  method  interacts  with  other 
methods,  and  deadline.  Evaluation  includes  both  checking  the  new  schedule  to  see  if  it 
is  better  than  the  best  schedule  currently  available  (and  if  it  is  remembering  it),  as  well 
as  looking  for  ways  to  improve  the  schedule.  For  example,  if  a  method  in  the  alternative 
could  not  be  scheduled  because  it  was  not  enabled,  a  new  alternative  will  be  generated 
that  includes  all  of  the  methods  in  the  scheduled  alternative  plus  a  method  or  methods 
to  enable  the  unenabled  method. 

In  the  simple  coffee  example  it  is  possible  to  enumerate  all  reasonable  schedules 
and  optimally  pick  the  best  one  (which,  of  course,  depends  on  the  deadline).  However, 
in  general,  when  there  are  m  methods,  there  are  2^  possible  combinations  of  these 
methods  (the  power  set  of  m).  Each  of  these  combinations  has  more  than  z!  possible 
orderings  where  i  is  the  number  of  methods  in  a  particular  combination.  A  problem 
with  that  kind  of  exponential  growth  clearly  requires  a  heuristic  solution  technique. 

In  the  remainder  of  this  paper  we  first  describe  our  design-to-time  scheduling  al¬ 
gorithm,  including  its  mechanisms  for  handling  task  interactions  and  uncertainties. 
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Then  we  describe  the  results  of  experiments  run  to  understand  under  what  environ¬ 
mental  conditions  the  design-to-time  approach  could  be  useful.  Finally  we  draw  some 
conclusions  and  discuss  future  work. 

2  Heuristic  Design-to-time  Scheduling  Algorithm 

Our  heuristic  design-to-time  scheduling  algorithm  consists  of  three  main  components: 
a  simple  planner  that  generates  alternatives  (unordcred  combinations  of  executable 
methods),  a  scheduler  that  assigns  start  and  finish  times  to  the  methods  from  an 
alternative,  and  an  evaluator  that  both  remembers  the  best  schedules  found  so  far  and 
looks  for  ways  to  improve  the  current  schedule.  Figure  1  provides  a  graphical  picture 
of  the  connected  components  of  this  algorithm. 

That  the  algorithm  consists  of  these  three  particular  components  is  because  they 
help  to  reduce  the  overall  complexity  of  the  problem  to  manageable  levels.  Initial 
planning  is  done  at  a  higher  level  of  abstraction,  ignoring  interactions  between  tasks 
and  deadlines,  to  allow  us  to  feasibly  consider  a  broad  cross-section  of  possible  solution 
plans.  Scheduling  is  done  on  only  those  plans  that  make  it  through  the  first  level  of  the 
algorithm,  because  completely  scheduling  all  possible  plans  would  be  computationally 
impossible  in  most  situations.  Evaluation  is  done  at  the  end  to  reduce  the  likelihood 
that  we  are  missing  good  plans  by  suggesting  plans  that  are  simple  variations  on  existing 
plans  that  could  have  increased  value. 

The  input  to  the  algorithm  is  a  T^EMS  task  structure  that  describes  the  set  of  problems 
to  be  solved,  the  interactions  within  those  problems,  uncertainties  about  the  expected 
quality  and  duration  of  methods,  and  the  deadlines  by  which  problems  must  be  solved. 
An  example  of  such  as  task  structure  is  shown  in  Figure  2. 

The  first  step  of  the  algorithm  is  to  generate  a  set  of  alternatives  for  solving  the 
overall  problem.  This  done  by  recursively  generating  alternatives  from  the  leaves  of  the 
TJEMS  task  structure  up  to  the  root.  For  a  standard  method  there  is  just  one  alternative, 
containing  the  method,  with  the  expected  quality  and  duration  of  the  method.  Our 
algorithm  also  supports  anytime  methods  (i.e.,  methods  that  are  anytime  algorithms 
that  always  have  a  result  at  hand  and  return  higher  quality  results  as  they  are  given 
more  execution  time,  up  to  some  bound).  For  anytime  methods  we  generate  a  small 
fixed  number  of  alternatives  by  choosing  points  on  the  performance  profile  (a  function 
mapping  duration  to  expected  quality)  of  the  anytime  method.  These  points  are  chosen 
in  a  way  that  minimizes  the  difference  in  estimated  quality  and  duration  between  the 
chosen  points. 

For  higher  level  tasks  in  the  task  structure  alternatives  are  generated  from 

the  alternatives  for  subtasks,  depending  on  the  quality  accumulation  function  of  the 
task.  The  goal  is  to  generate  plans  with  different  resource  requirements  to  allow 
planning  at  the  next  level  to  have  a  good  cross-section  of  choices.  This  is  done  using  a 
dynamic  programming  approach  that  finds  the  highest  expected  quality  combination 
of  alternatives  for  subtasks,  for  each  possible  duration  for  the  task.  Note  that  the 
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estimated  duration  and  quality  associated  with  an  alternative  are  often  not  accurate 
because  they  do  not  take  interactions  between  tasks  into  account.  To  fully  take  these 
interactions  into  account  requires  scheduling,  which  we  dont  do  at  this  stage  of  the 
algorithm,  because  it  would  be  much  too  computationally  expensive.  The  resulting  list 
of  alternatives  for  a  task  is  pruned  to  a  manageable  number  and  becomes  the  input  for 
generating  the  alternatives  for  its  supertask. 

Once  a  set  of  alternatives  has  been  generated  for  the  overall  set  of  problems  to  be 
solved,  the  next  set  of  steps  in  the  algorithm  begin.  This  is  a  loop  that  first  chooses 
an  alternative  to  schedule,  then  schedules  that  alternative,  then  evaluates  the  resulting 
schedule  possibly  generating  additional  alternatives.  This  loop  continues  until  either 
no  known  alternative  could  produce  a  better  schedule  that  the  best  current  schedule  or 
all  alternatives  are  exhausted  or  a  threshold  number  of  alternatives  has  been  scheduled. 

The  next  alternative  to  schedule  is  chosen  using  a  heuristic  algorithm  with  the 
following  priorities: 

1 .  If  no  schedules  have  been  found,  choose  the  fastest  alternative  that  would  achieve 
quality  for  all  problems. 

2.  If  the  best  schedule  found  so  far  achieves  a  total  quality  of  q  and  has  total  duration 
d,  choose  alternatives  that  could  achieve  quality  greater  than  q  or  could  achieve 
at  least  quality  q  in  duration  less  than  d. 

Dynamic  duration  thresholds  are  used  to  limit  the  set  of  alternatives  considered  to  those 
with  an  estimated  duration  below  a  threshold.  This  threshold  is  set  depending  on  the 
estimated  duration  of  the  alternative  for  the  best  schedule  found  so  far,  on  how  long 
we  have  been  scheduling  since  finding  a  new  schedule,  and  a  few  other  considerations. 

The  next  step  is  to  actually  schedule  the  chosen  alternative.  The  algorithm  for  doing 
this  is  very  simple.  Remember  that  an  alternative  is  just  an  unordered  set  of  methods. 
Scheduling  is  done  by  rating  each  method  using  a  set  of  heuristics.  The  highest  rated 
method  is  added  to  the  schedule,  then  the  remaining  methods  are  rated  again.  This 
continues  until  all  methods  are  scheduled  or  no  method  is  rated  as  acceptable  for  adding 
to  the  schedule.  Heuristics  include  preferring  methods  with  earlier  deadlines,  rating  as 
unacceptable  any  method  that  is  not  enabled,  and  preferring  methods  that  enable  other 
methods. 

Once  a  schedule  has  been  produced,  it  is  evaluated.  If  it  is  better  than  the  best 
current  schedule,  it  becomes  the  best  schedule.  The  evaluation  criteria  include  how 
many  problems  are  solved,  the  total  quality  of  the  solutions  (higher  quality  preferred), 
the  total  duration  of  the  schedule  (shorter  duration  preferred),  and  the  likelihood  that 
the  schedule  will  perform  as  expected  (higher  certainty  preferred). 

Schedules  are  also  looked  at  by  various  improvement  operators  that  try  to  find  ways 
to  improve  the  schedule.  For  example,  one  improvement  notices  when  methods  in 
an  alternative  could  not  be  scheduled  because  they  were  not  enabled  and  adds  new 
methods  to  the  alternative  to  enable  those  methods.  This  new  alternative  is  added 
to  the  list  of  alternatives  that  is  considered  for  scheduling.  Once  a  schedule  has  been 
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considered  for  improvements,  we  go  back  to  the  beginning  of  the  loop,  choose  another 
alternative  to  schedule  and  continue. 

3  Experimental  Results 

These  experiments  have  two  main  goals.  One  is  to  show  the  usefulness  of  the  integration 
of  planning  and  scheduling.  The  other  is  to  examine  whether  the  environmental  param¬ 
eters  that  we  consider  most  important  (interactions  between  tasks  and  the  availability 
of  multiple  methods)  in  fact  can  lead  to  improved  system  performance. 

One  of  the  useful  features  of  the  T.EMS  task  structure  environment  is  that  it  is 
possible  to  generate  many  random  problems  for  experimentation.  T/EMS  also  supports 
easy  replication  of  environments  with  just  those  parameters  under  study  being  varied, 
allowing  us  to  do  matched  pair  experiments.  This  means  that,  for  example,  when  we 
vary  the  likelihood  of  facilitates,  only  the  number  of  facilitates  changes,  all  other  aspects 
of  the  task  structure  remain  the  same  for  a  set  of  runs.  In  the  experiments  described 
here  100  different  environments  were  generated,  then  the  parameters  of  interest  were 
varied  within  those  environments. 

The  first  experiment  looks  at  the  importance  of  the  planning  component  of  our 
scheduling  algorithm  in  the  overall  performance  of  the  algorithm.  In  this  experiment 
we  examine  the  performance  of  our  scheduling  algorithm  using  our  standard  planning 
component  compared  to  the  performance  after  replacing  that  component  with  a  random 
alternative  generator  and  with  a  generator  that  generates  one  alternative  containing  all 
methods.  The  random  alternative  generator  generates  alternatives  of  a  fixed  size  by 
randomly  choosing  methods.  The  expectation  is  that  use  of  the  planner  will  significantly 
improve  overall  system  performance.  Figure  3  shows  proportion  of  expected  quality 
versus  deadline  parameter  for  the  three  algorithm  variations  studied.  As  the  deadline 
parameter  gets  higher,  deadlines  get  looser  allowing  more  time  to  be  spent  executing 
tasks.  As  this  graph  shows,  the  scheduler  with  the  standard  planning  component 
outperforms  the  other  two  schedulers.  A  two-way  analysis  of  variance  (ANOVA)  shows 
significant  main  effects  for  both  the  deadline  parameter  (F  =  194.5,  p  <  0.00001)  and 
which  alternative  generator  is  used  (F  =  95.0,  p  <  0.00001),  as  well  as  an  interaction 
effect  between  the  two  (F  =  9.9,  p  <  0.00001).  Our  interpretation  of  the  interaction 
effect  IS  that  as  deadlines  become  looser,  the  value  of  the  planning  mechanism  becomes 
more  important,  because  it  allows  facilitates  to  be  fully  taken  advantage  of  The  overall 
result  of  this  experiment  is  that  the  planning  component  leads  to  significantly  better 
performance  under  all  situations  studied,  and  that  the  effect  on  performance  becomes 
more  positive  as  deadlines  become  looser. 

The  remaining  experiments  look  at  the  effect  of  different  environmental  parameters 
on  the  performance  of  the  system.  In  the  first  experiment  we  varied  the  number  of  leaf 
methods  created  at  the  points  in  the  task  structure  generation  process  where  methods 
are  created.  As  additional  methods  are  added  they  are  given  expected  quality  and 
duration  as  a  proportion  of  the  quality  and  duration  of  the  first  method  created.  That 
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Figure  3:  Deadline  parameter  setting  versus  mean  proportion  of  expected  quality  for 
three  scheduler  configurations. 


is,  the  first  method  has  quality  q  and  duration  d.  If  there  are  two  methods,  the  second 
one  gets  quality  q/2  and  duration  d/2.  If  there  are  5  methods,  they  get  qualities 
of  q/by  2q/by  3q/5y  4g/5,  and  q  with  the  corresponding  durations.  Figure  4  shows 
the  effect  of  varying  the  number  of  leaf  methods  on  the  quality  achieved.  Quality  is 
measured  as  a  proportion  of  the  expected  quality  (the  maximum  quality  that  could 
be  achieved  without  deadlines  or  interactions  between  tasks).  The  expectation  is  that 
adding  more  leaf  methods  will  result  in  more  options,  allowing  higher  quality  to  be 
achieved,  especially  at  tighter  deadline  values.  As  figure  4  shows,  this  appears  to  be  the 
case.  Using  the  data  from  this  experiment  a  two-way  ANOVA  shows  a  significant  main 
effect  for  the  number  of  leaf  methods  (F  =  11.8,  p  <  0.00001).  The  effect  of  facilitates 
likelihood  is  not  significant  (F  =  1.66,  p  <  0.19)  an^  no  interaction  effect  is  detected. 
The  overall  result  of  this  experiment  is  that  adding  additional  methods  (at  least  up  to 
the  total  of  3  or  4)  significantly  improves  system  performance. 

In  the  second  experiment  we  looked  at  the  effect  of  varying  the  likelihood  of 
facilitates.  As  part  of  task  structure  generation,  facilitates  are  added  to  the  task  structure 
depending  on  the  likelihood  of  facilitates.  In  this  experiment  we  varied  facilitates 
likelihood  from  0  (no  facilitates)  to  1  (facilitates  at  each  place  where  one  is  possible). 
Figure  5  shows  the  proportion  of  expected  quality  (as  in  the  previous  experiment) 
versus  the  facilitates  likelihood  for  four  values  of  a  deadline  parameter  at  high  facilitates 
strength.  Facilitates  strength  refers  to  the  power  of  the  effect  of  the  facilitates  on  quality 
and  duration.  The  expectation  is  that  more  facilitates  will  allow  higher  quality  to  be 
achieved  both  because  facilitates  directly  increase  the  quality  of  facilitated  methods  and 
because  durations  are  reduced,  possibly  allowing  more  tasks  to  be  completed.  Using 
the  data  from  this  experiment  we  ran  a  two-way  ANOVA.  It  showed  a  significant  main 
effect  of  both  deadline  parameter  (F  =  58.7,  p  <  0.00001)  and  facilitates  likelihood  (F 
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Figure  4:  Number  of  leaf  methods  versus  proportion  of  expected  quality  for  three  values 
of  facilitates  likelihood  with  tight  deadlines 
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Figure  5:  Facilitates  likelihood  versus  proportion  of  expected  quality  for  four  values  of 
a  deadline  parameter  at  high  facilitates  strength. 
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=  9.7,  p  <  0.00001).  No  significant  interaction  effect  was  detected. 

One  interesting  question  that  can  be  asked  about  the  result  of  this  experiment 
is  if  the  positive  effect  of  the  facilitates  happens  specifically  because  the  scheduling 
algorithm  takes  facilitates  into  account  or  whether  the  effect  would  remain  even  if  the 
algorithm  did  not  pay  attention  to  the  facilitates.  We  examined  this  question  by  adding 
a  parameter  that  allows  the  visibility  of  facilitates  to  the  scheduler  to  be  varied  separate 
from  the  likelihood  of  facilitates  appearing.  Figure  6  shows  the  proportion  of  expected 
quality  versus  facilitates  visibility  for  five  values  of  facilitates  likelihood  at  high  facilitates 
strength.  The  facilitates  visibility  parameter  affects  the  likelihood  of  a  facilitates  being 
detected  by  the  scheduler,  but  does  not  affect  its  effect  as  methods  are  executed.  The 
expectation  is  that  as  the  likelihood  that  the  scheduler  sees  facilitates  increases,  the 
quality  increases,  because  the  scheduler  will  be  able  to  schedule  to  take  advantage 
of  facilitates  (as  opposed  to  taking  advantage  of  them  inadvertently).  As  figure  6 


Facilitates  likelihood:  0 
Facilitates  likelihood:  0.25 
Facilitates  likelihood:  0.5 
Facilitates  likelihood:  0.75 
Facilitates  likelihood:  1 


Figure  6:  Likelihood  of  facilitates  visibility  versus  mean  proportion  of  expected  quality 
for  deadline  parameter  setting  0.5  and  high  facilitates  power. 


shows,  quality  increases  significantly  as  facilitates  visibility  increases,  particularly  at 
higher  facilitates  likelihoods.  This  shows  up  in  our  two-way  AN OVA  test,  which 
shows  significant  main  effects  for  both  facilitates  visibility  (F  =  18.9,  p  <  0.00001)  and 
facilitates  likelihood  (F  =  7.0,  p  <  0.00001),  as  well  as  nearly  showing  an  interaction 
effect  between  the  two  (F  =  1.79,  p  <  0.075). 


4  Conclusions 

In  solving  real-time  AI  problems  it  can  be  useful  to  combine  a  simple  planning  mecha¬ 
nism  with  a  scheduler  to  find  problem  solutions.  Planning  is  useful  for  choosing  which 
particular  methods  to  combine  to  solve  the  problem  at  hand  depending  on  how  those 
methods  interact  and  how  tight  the  deadlines  are.  Scheduling  is  necessary  to  allow  a 
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clear  understanding  of  when  and  how  particular  subproblems  will  be  solved.  We  have 
shown  the  design-to-time  real-time  scheduling  algorithm  for  solving  problems  of  this 
sort  by  combining  a  simple  planning  mechanism  with  a  scheduler. 

Through  experimentation  we  demonstrated  that  several  factors  have  a  significant 
effect  on  the  performance  of  the  system.  Having  more  methods  available  leads  to 
increased  quality.  Having  more  facilitates  to  take  advantage  of  also  leads  to  increased 
quality.  The  quality  increase  is  higher  when  facilitates  are  visible  to  the  scheduler 
than  when  they  are  hidden  from  it  but  still  present.  Our  simple  planning  mechanism 
outperforms  the  random  choosing  of  methods  and  choosing  all  of  the  methods  (forcing 
the  scheduling  component  to  choose  which  methods  to  schedule). 

We  have  began  to  extend  our  approach  to  allow  us  to  solve  problems  where  non- 
CPU  resources  exist  and  multiple  threads  of  control  must  be  scheduled  (e.g.,  parallel 
or  distributed  problem  solving).  T/£MS  supports  the  representation  of  these  resources, 
including  resources  that  can  be  used  by  one  thread  of  control  at  a  time  (such  as 
machines  in  a  job  shop)  and  multiple  access  resources  (such  as  a  shared  network  whose 
performance  degrades  as  its  usage  increases).  Such  an  extension  would  allow  us  to  solve 
a  broader  class  of  time-constrained  problems. 

This  paper  describes  only  a  portion  of  the  work  that  has  been  done  on  the  design- 
to-time  approach  to  problem  solving.  Other  areas  that  have  been  investigated  include 
uncertainty  in  method  quality  and  duration,  monitoring  of  method  execution,  embed¬ 
ding  a  scheduler  in  a  larger  application,  and  a  significantly  larger  set  of  experiments 
that  study  the  usefulness  of  the  approach.  Further  details  on  this  work  can  be  found  in 
the  first  authors  dissertation  [4]. 
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Abstract 

In  naturally  distributed,  homogeneous,  cooperative  problem  solving  environments  where  well- 
defined  tasks  arrive  at  multiple  locations,  decisions  must  be  made  about  the  extent  of,  and  overlap 
between,  each  agents  area  of  responsibility — the  agents’  organization.  The  organization  may  be  con¬ 
structed  statically  by  a  system  designer,  or  dynamically  by  the  agents  during  problem  solving.  No  one 
organization  is  optimal  across  environments  or  even  specific  problem  solving  instances  [6,  7,  8]. 

This  paper  presents  an  analysis  of  static  and  dynamic  organizational  structures  for  this  class  of 
environments,  exemplified  by  distributed  sensor  networks.^  We  first  show  how  the  performance  of 
any  static  organization  can  be  statistically  described,  and  then  show  under  what  conditions  dynamic 
organizations  do  better  and  worse  than  static  ones.  Finally,  we  show  how  the  variance  in  the  agents’ 
performance  leads  to  uncertainty  about  whether  a  dynamic  organization  will  perform  better  than  a  static 
one  given  only  agent  a  priori  expectations.  In  these  cases,  we  show  when  meta-level  communication 
about  the  actual  state  of  problem  solving  will  be  useful  to  agents  in  constructing  a  dynamic  organizational 
structure  that  outperforms  a  static  one.  Viewed  in  its  entirety,  this  paper  also  presents  a  methodology  for 
answering  questions  about  the  design  of  distributed  problem  solving  systems  by  analysis  and  simulation 
of  the  characteristics  of  a  complex  environment  rather  than  by  relying  on  single-instance  examples. 


^Portions  of  this  technical  report  appeared  in  the  proceedings  of  IJCAI-93  An  approach  to  analyzing  the  need 
for  meta-level  communication,  and  AAAI-93  A  one-shot  dynamic  coordination  algorithm  for  distributed  sensor 
networks  .  This  work  was  supported  by  DARPA  contract  N00014-92-J-1698,  Office  of  Naval  Research  contract 
N00014-92-J-1450,  and  NSF  contract  CDA  8922572.  The  content  of  the  information  does  not  necessarily 
reflect  the  position  or  the  policy  of  the  Government  and  no  official  endorsement  should  be  inferred. 


1  Introduction 


Organizational  theorists  have  long  held  that  the  organization  of  a  set  of  agents  cannot  be  analyzed 
separately  from  the  agents’  task  environment,  that  there  is  no  single  best  organization  for  all  environments, 
and  that  different  organizations  are  not  equally  effective  in  a  given  environment  [8].  Most  of  these 
theorists  view  the  uncertainties  present  in  the  environment  as  a  key  characteristic,  though  they  differ 
in  the  mechanisms  that  link  environmental  uncertainty  to  effective  organization.  In  particular,  the 
transaction  cost  economics  approach  [11]  focuses  on  the  relative  efficiencies  of  various  organizations  given 
an  uncertain  environment,  while  the  modern  contingency  theory  approach  [14]  focuses  on  the  need  for  an 
organization  to  expand  toward  the  earliest  available  information  that  resolves  uncertainties  in  the  current 
environment. 

In  this  paper  we  use  both  of  these  concepts  to  analyze  potential  organizational  structures  for  a 
class  of  naturally  distributed,  homogeneous,  cooperative  problem  solving  environments  where  tasks 
arrive  at  multiple  locations,  exemplified  by  distributed  sensor  networks  [10],  Previous  approaches  to 
analyzing  organizations  in  distributed  sensor  networks  have  either  not  focused  on  the  effectiveness 
of  the  organization  [2,  12],  or  have  only  analyzed  organizational  effectiveness  in  particular,  single¬ 
instance  examples  [7].  Our  approach  is  to  model  the  task  environment  mathematically,  using  a 
formalism  developed  specifically  to  study  distributed  coordination  and  scheduling  [5] .  We  then  develop 
expressions  for  the  expected  efficiencies  of  static  and  dynamic  organizational  structures,  in  terms  of  the 
cost  of  communication  and  time  to  complete  a  given  set  of  tasks.  Finally,  we  validate  these  mathematical 
models  by  using  simulations. 

A  dynamic  organization  is  one  in  which  the  responsibilities  of  agents  can  be  reassigned  based  on  a 
developing  view  of  the  problem  at  hand.  Due  to  the  uncertainties  explicitly  represented  in  the  task  envi¬ 
ronment  model,  there  may  not  be  a  clear  performance  tradeoff  between  static  and  dynamic  organizational 
structures.  Agents  that  have  a  dynamic  organization  have  the  option  of  meta-level  communication — 
communicating  about  the  current  state  of  problem  solving  as  opposed  to  communicating  about  solving 
the  problem  itself  In  this  way,  information  that  resolves  uncertainties  about  the  current  environment 
becomes  available  to  the  agents,  allowing  the  agents  to  then  create  the  most  efficient  organization  for 
the  situation. 

Section  2  describes  the  task  environment  model,  the  assumptions  behind  it,  and  analyzes  the 
uncertainties  present.  Section  4  describes  static  and  dynamic  organizational  structures,  and  develops 
expressions  for  the  expected  performance  of  each  organizational  style.  This  section  also  describes  a 
particular  algorithm  for  generating  dynamic  structures,  and  discusses  the  actual  performance  of  the 
algorithm.  In  Section  5  we  then  show  how  the  variance  in  performance  without  communication  can 
lead  to  the  efficient  use  of  meta-level  communication  to  customize  a  dynamic  organizational  structure. 
Finally,  we  discuss  how  these  results  can  be  used  by  designers  of  distributed  problem  solvers,  and  how 
our  methodology  can  be  used  by  other  researchers.  Throughout  each  section,  we  will  illustrate  and 
confirm  the  analytical  results  experimentally,  using  as  an  example  a  simulated  distributed  sensor  network 
similar  to  the  Distributed  Vehicle  Monitoring  Testbed  (DVMT)  [10]. 


2  Task  Environment  Model 

Our  task  environment  model  of  naturally  distributed  problems  assumes  that  several  independent  groups 
of  tasks  arrive  at  multiple  locations  over  a  period  of  time  called  an  episode.  For  example,  in  a  distributed 
sensor  network  (DSN)  episode  the  movements  of  several  independent  vehicles  will  be  detected  over 
a  period  of  time  by  one  or  more  distinct  sensors,  where  each  sensor  is  associated  with  an  agent. 
The  performance  of  agents  in  such  an  environrnent  will  be  based  on  how  long  it  takes  them  to 
process  all  the  task  groups,  which  will  include  the  cost  of  communicating  data,  task  results,  and  meta¬ 
level  communication,  if  any.  The  organizational  structure  of  the  agents  will  imply  which  subsets  of 
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which  task  groups  are  available  to  which  agents  and  at  what  cost.  For  example,  if  DSN  agents  have 
overlapping  sensors,  either  agent  can  potentially  work  on  data  in  the  overlapping  area  (from  its  own 
sensor)  without  any  extra  communication  costs.  We  make  several  simplifying  assumptions:  that  the 
agents  are  homogeneous  (have  the  same  capabilities  with  respect  to  receiving  data,  communicating,  and 
processing  tasks),  that  the  agents  are  cooperative  (interested  in  maximizing  the  system  performance  over 
maximizing  their  individual  performance),  that  the  data  for  each  episode  is  available  simultaneously 
to  all  agents  as  specified  by  their  initial  organization,  and  that  there  are  only  structural  (precedence) 
constraints  within  the  subtasks  of  each  task  group.  ^ 

Any  single  episode  can  be  specified  by  listing  the  task  groups,  and  what  part  of  each  task  group  was 
available  to  which  agents,  given  the  organizational  structure.  Our  analysis  will  be  based  on  the  statistical 
properties  of  episodes  in  an  environment,  not  any  single  instance  of  an  episode.  The  properties  of  the 
episodes  in  a  DSN  environment  are  summarized  by  the  tuple  P  =<  A,  77,  r,  o,  T  >  where  A  specifies 
the  number  of  agents,  rf  the  expected  number  of  task  groups,  r  and  o  specify  the  structural  portion 
of  the  organization  by  the  range  of  each  agent  and  the  overlap  between  agents,  and  'T  specifies  the 
homogeneous  task  group  structure  (Section  2.5  and  Figure  4  describes  how  task  group  structures  are 
specified). 

Our  analysis  initially  focuses  on  exactly  what  a  priori  knowledge  agents  have  about  the  distribution 
of  task  groups  in  an  episode.  First  we  will  look  at  the  distribution  of  the  lowest-level  sensor  subtasks 
of  a  single  task  group  among  multiple  agents  (deriving  the  maximum  expected  number  of  subtasks), 
and  then  we  will  look  at  the  distribution  of  task  groups  themselves.  These  results  will  then  be  used  in 
subsequent  sections  to  derive  the  total  amount  of  work,  and  therefore  expected  termination  performance, 
under  various  organizational  structures  and  control  schemes.  Section  2.2  derives  the  expected  number 
of  low-level  sensor  subtasks  at  the  most  heavily-loaded  agent  given  the  number  of  task  groups  that  same 
agent  sees,  how  many  agents  see  a  single  task  group,  and  the  environmental  parameters.  Section  2.3 
then  derives  the  number  of  task  groups  at  the  most  heavily-loaded  agent  given  the  number  of  agents 
that  see  a  single  task  group  and  the  environmental  parameters.  Section  2.4  derives  the  number  of  agents 
that  will  see  a  single  task  group  given  the  environmental  parameters.  Finally,  Section  2.5  will  show  the 
structure  of  a  task  group,  which  allows  us  to  derive  the  amount  of  work  an  agent  must  do  given  the 
number  of  sensor  sub  casks  it  executes,  the  number  of  task  groups  at  the  agent,  and  the  number  of  agents 
involved  in  a  single  task  group.  All  of  these  parts  together  allow  us  to  derive  the  expected  termination 
time  of  a  particular  system  of  agents. 

2. 1  Task  environment  simulation 

In  the  next  section  and  for  the  rest  of  the  paper,  we  will  test  the  model  we  are  developing  against 
simulated  DSN  problems.  Each  simulated  DSN  episode  will  take  place  on  a  grid  where  the  concepts  of 
length  and  size  correspond  directly  to  physical  distances.  For  example,  Figure  1  illustrates  several  simple 
organizations  imposed  on  such  a  grid  in  our  simulation. 

In  the  simulation  we  assume  that  each  vehicle  is  sensed  at  discrete  integer  locations  (as  in  the  DVMX), 
randomly  entering  on  one  edge  and  leaving  on  any  other  edge.  Inbetween  the  vehicle  travels  along  a  track 
moving  either  horizontally,  vertically,  or  diagonally  each  time  unit  using  a  simple  DDA  line-drawing 
algorithm  (see  Figure  5).  In  an  18  X  18  grid,  the  (empirical)  average  length  of  a  track  is  14  units — the 
actual  length  of  any  one  track  will  range  from  2  to  19  units  and  is  not  distributed  normally.  Given  the 
organization  (r,  o,  and  A,  and  the  geometry),  we  can  calculate  what  locations  are  seen  by  the  sensors 
of  each  agent.  This  information  can  then  be  used  along  with  the  locations  traveled  by  each  vehicle  to 
determine  what  part  of  each  task  group  is  initially  available  to  each  agent.  Section  2.5  will  detail  what 
the  structure  of  each  task  group  is  for  the  DSN  simulation. 

^  In  general  there  are  usually  more  complex  interrelationships  between  subtasks  that  affect  scheduling  decisions,  such  as 
facilitation  [4] , 


2 


Figure  1:  Examples  of  18  x  18  DSN  organizations 


This  specification  is  applicable  to  other  interesting  environments.  For  example:  each  task  group  may 
comprise  several  different  types  of  subtasks;  each  agent  may  only  respond  to  a  certain  type  of  subtask 
(its  ‘range');  multiple  agents  may  respond  to  the  same  types  (overlap ).  In  general,  the  parameters  of 
‘range'  and  ‘overlap’  can  be  multidimensional, 

2.2  Expected  number  of  sensor  subtasks 

In  order  to  analyze  the  performance  of  a  particular  organization,  we  will  want  to  know  what  proportion 
of  each  task  group  each  agent  is  likely  to  process.  There  will  be  some  upper  limit  on  this  proportion 
(related  to  the  agent's  range  r),  and  sometimes  the  agent  will  process  less  than  this  upper  limit.  Especially 
in  static  organizational  structures  where  tasks  are  not  exchanged,  the  termination  of  the  system  as  a  whole 
can  be  tied  to  the  completion  of  all  tasks  at  the  most  heavily  loaded  agent.  Normally,  we  would  use  the 
average  part  of  a  task  group  to  be  seen,  but  since  the  focus  of  our  analysis  is  the  termination  of  problem 
solving,  we  need  to  examine  the  expected  maximum  portion  of  a  task  group  to  be  seen.  This  section 
will  develop  an  equation  for  the  expected  maximum  workload  at  an  agent  by  counting  the  expected 
number  of  low-level  sensor  subtasks  (each  individually  associated  with  a  sensed  vehicle  location)  that 
the  maximally  loaded  agent  will  have. 

The  amount  of  a  single  task  group  seen  by  an  agent  (which  is  the  same  as  the  number  of  sensor 
subtasks  in  the  DSN  example)  can  be  viewed  as  a  random  variable  S  with  a  probability  density  function 
and  corresponding  cumulative  distribution  function.  In  the  DSN  environment,  S  is  discrete,  and  its 
probability  function  (determined  empirically)  is  heavily  weighted  toward  v  (the  maximum).  To  simplify 
the  analysis,  instead  of  letting  S  correspond  to  the  number  of  subtasks  in  a  single  task  group  seen  by  an 
agent,  lets  create  a  new  random  variable  S  that  we  have  equal  1  if  the  agent  sees  the  maximum  amount, 
and  0  otherwise.  Now  S  has  a  Bernoulli  (coin-tossing)  distribution  with  parameter  p  corresponding 
to  the  chance  of  an  agent  seeing  the  maximum  amount  r  of  a  task  group.  Let's  assume  we  know  that 

^  ^  is  the  number  of  task  groups  at  the  maximally  loaded  agent,  and  that  on  average  a  <  i4  agents 
see  a  single  task  group  (we  11  remove  these  assumptions  later).  The  number  of  times  an  agent  sees  the 
maximum  out  of  N  task  groups  {N  coin  flips)  then  has  a  binomial  distribution  (6jv'^p(s)).  We  need  to 
know,  given  that  a  agents  each  flip  N  coins,  what  the  distribution  is  of  the  maximum  number  of ‘heads' 
any  agent  sees — this  is  called  the  binomial  max  order  statistic,  ga,N,p{s)'>^ 

*Ar,p(5)  =  [Pr[S  =  5]] 

BnA^)  =  H‘x=o^ff,pix)  [Pr[S<s]] 

9a,N,p{^)  =  -Bjv,p(s)“  -  Bn^p{s  -  1)“  [Pr  [S  =  «]] 

The  random  variable  S  referred  to  any  agent,  the  new  random  variable  S  refers  to  the  maximally 
loaded  agent.  Its  probability  function  q{^s)  has  a  much  steeper  shape  and  larger  expected  value  than  the 
binomial  6(s)  (try  it  and  see).  In  the  DSN  example  we  have  p  =  0.5®  and  the  amount  of  a  task  group 

A  more  detailed  derivation  of  this  result  is  in  our  tech  report. 

Empirically  determined  through  simulation. 
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seen  when  an  agent  does  not  see  the  maximum  amount  averages  r/2.  Now  we  can  convert  back  from 
the  easy  to  analyze  random  variable  S  to  the  variable  we  are  really  interested  in,  S,  by  the  equation 
5  =  (rS  +  {r/2)[N  —  S)).  The  expected  heaviest  load  seen  by  any  agent  when  a  agents  see  N  task 
groups  with  a  probability  p  of  seeing  r  and  probability  1  —  p  of  seeing  r/2  is: 

N 

a]  =  ^  (/a,iV,pW(r-s+  Un  -  s))  (1) 

5=0 

To  reiterate:  out  of  N  task  groups  the  maximally  loaded  agent  sees  the  maximum  r  some  S  (a  random 
variable)  times,  and  the  other  [N  —  S)  times  it  sees  only  r/2.  Eq.  1  shows  the  expected  value  of  a  new 
random  variable  S  that  indicates  the  number  of  sensor  sub  tasks  at  the  maximally  loaded  agent  when 
there  are  N  task  groups  in  an  episode.  N  is  itself  a  random  variable,  we’ll  look  at  its  distribution  in  the 
next  section. 

Figure  2  shows  the  heaviest  load  actually  observed  and  averaged  over  1000  runs,  plotted  against  the 
expected  value,  for  n  tracks  and  all  square  DSN  organizations  [2  <  r  <  10,0  <  o  <  r,  1  <  y/A  <  10, 1  < 
N  <  10]  {B}  >  .98)^.  The  colors  of  the  points  refer  to  the  value  of  r;  lighter  grey  corresponds  to  larger 
values  of  r. 


Figure  2:  Actual  versus  predicted  heaviest  load  far;  various  values  of  Ay  r,  o,  and  N 


2.3  Expected  Number  of  Task  Groups 

Given  the  maximum  number  of  task  groups  seen  by  an  agent  (AT),  we  can  calculate  the  expected  heaviest 
agent  load  using  Equation  1.  But  this  begs  the  question  of  what  is  the  maximum  number  of  task  groups 
an  individual  agent  will  see,  given  the  actual  (n)  or  expected  number  {t})  that  the  entire  system  will 
see.  The  solution  is  similar — each  agent  either  sees  or  does  not  see  each  of  the  n  task  groups,  another 
binomial  process.  Let  Ni  be  the  number  of  task  groups  sensed  by  agent  z,  with  a  binomial  distribution 
of  parameters  n  and  q.  If  a  is  again  the  number  of  agents  that  see  a  single  task  group  and  A  the  total 
number  of  agents,  then  q  —  af  A,  the  probability  that  each  agent  will  see  a  particular  track  (we’ll  give 
an  equation  for  a  next).  By  the  same  derivation  as  in  the  last  section,  the  max  order  statistic  N  has  the 
probability  function  9A,n,a/A{^)y  expected  value: 

n 

E[iV|n,a]  =  (2) 

5=0 

is  the  squared  correlation  coefficient,  a  measure  of  goodness  of  fit.  It  may  be  interpreted  as  the  proportion  of  total 
variabity  in  the  observed  data  that  is  explained  by  the  model. 
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Figure  3  shows  the  actual  mean  value  of  the  maximum  number  of  tracks  seen  by  an  agent  over  1000 
runs  of  the  DSN  simulation,  versus  the  predicted  value,  for  n  tracks  and  square  DSN  organizations 
[2  <  r  <  10, 1  <  \fA  <  10, 1  <  n  <  10]  without  any  overlap  (jO'  >  .96) 


Predicted  Maximum  Number  of  Tracks  Predicted  number  of  agents  to  see  a  track 

Figure  3:  On  the  left,  actual  versus  predicted  maximum  number  of  task  groups  (tracks)  seen  by  any  one  agent  for 
various  r,  Ay  and  n.  On  the  right,  actual  versus  predicted  average  number  of  agents  seeing  a  single  task  group 
(track)  for  various  r,  o,  and  A. 


2.4  Expected  Number  of  Agents 

The  only  remaining  term  we  need  to  analyze  before  deriving  an  expression  for  system  performance 
is  a,  the  expected  number  of  agents  that  will  see  a  single  task  group.  In  general,  a  will  depend  on 
the  total  number  of  agents  A  and  the  organization  (r  and  o).  When  there  is  only  one  agent,  it  will 
see  every  task  group  {a  —  1).  When  the  agents  overlap  completely,  every  agent  sees  every  task  group 
([o  =  r]  [a  ==  A]).  When  the  agents  in  a  square  environment  do  not  overlap,  a  =  ^/A.  The 
relationship  follows  the  ratio  of  the  area  solely  covered  by  an  agent  plus  the  area  of  the  overlapping 
section,  to  the  total  area  covered  alone: 

a  =  (3) 

Note  that  a  is  not  a  random  variable,  it  is  just  derived  directly  from  environmental  parameters.  Figure  3 
shows  a  regression  of  the  actual  average  value  of  a  over  1000  runs  verses  the  predicted  value  for  all  630 
DSN  organizations  [2  <  r  <  10,0  <  o  <  r,  1  <  \/A  <  10]  {B?  >  .98). 

2.5  Work  Involved  in  a  Task  Structure 

Finally  we  turn  to  modeling  the  performance  of  the  system  as  a  whole,  which  is  based  on  the  structure  of 
the  tasks  involved.  We  have  developed  a  characterization  of  task  environments  that  formally  captures  the 
range  of  features,  processes,  and  especially  interrelationships  that  occur  during  computationally  intensive 
coordination  and  scheduling  [5].  The  model  of  environmental  and  task  characteristics  we  propose  has 
three  levels:  objectivcy  subjectivcy  and  generative^  the  subjective  level  is  not  discussed  here. 

The  objective  level  describes  the  essential  structure  of  a  particular  problem-solving  situation  or 
instance  over  time.  It  focuses  on  how  task  interrelationships  dynamically  affect  the  quality  and  duration 
of  each  task.  In  this  paper  we  will  concentrate  only  on  duration  as  a  performance  metric.  The  basic 
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model  is  that  task  groups  T  occur  in  the  environment  at  some  frequency,  and  induce  tasks  T  to  be 
executed  by  the  agents  under  study.  Task  groups  are  independent  of  one  another,  but  tasks  within  a 
single  task  group  have  interrelationships.  An  individual  task  that  has  no  subtasks  is  called  a  method  M 
and  is  the  smallest  schedulable  chunk  of  work.  The  quality  and  duration  of  an  agents  performance  on 
an  individual  task  is  a  function  of  the  timing  and  choice  of  agent  actions  (‘local  effects'),  and  possibly 
previous  task  executions  (non-local  effects').  When  local  or  non-local  effects  exist  between  tasks  that  are 
known  by  more  than  one  agent,  they  become  coordination  relationships  [4,  3].  The  basic  purpose  of  the 
objective  model  is  to  formally  specify  how  the  execution  and  timing  of  tasks  affect  quality  and  duration. 

At  the  lowest  level,  each  method  (leaf  task)  M  at  time  t  can  produce,  if  executed,  some  maximum 
quality  q(M,  t)  in  some  amount  of  time  d(M,  i)  (each  method  has  an  initial  maximum  quality  and 
duration  qo(M)  and  do(M)).  Q{Mj  t)  will  denote  the  quality  of  method  M  at  time  t;  Q(M,  i)  =  0 
before  a  method  is  executed,  and  in  the  simplest  case  Q(M,  t)  =  q(M,  t)  =  qo(M)  after  the  method  is 
executed.  In  this  paper,  we  will  be  concerned  primarily  with  method  durations,  and  the  choice  of  method 
quality  accrual  function  Q{Mj  t)  is  not  significant.  Any  task  execution  that  starts  before  the  execution 
of  T  completes  may  potentially  affect  T’s  execution  through  non-local  ejfects.  There  are  precisely  two 
possible  non-local  effects  on  T  under  our  model:  duration  effects,  where  d(r,  t)  (duration)  is  changed 
for  some  or  all  of  the  methods  for  a  task;  and  quality  effects,  where  q(r,  t)  (quality)  is  changed  similarly. 
A  non-local  effect  on  a  task  is  initiated  by  the  execution  of  some  other  task  in  the  task  group.  The  effect 
is  dependent  on  the  relative  timing  of  the  two  task  executions,  the  quality  of  the  task  causing  the  effect, 
and  whether  information  was  transmitted  between  the  two  tasks. 

Other  work  has  considered  the  case  of facilitation,  a  non-local  effect  where  the  availability  of  a  result 
from  one  task  alters  the  quality  and  duration  of  another  task,  but  the  non-communication  of  a  result  has 
no  effect  [4].  This  work  considers  a  different  relationship,  precedence.  If  task  A  precedes  task  B,  then 
the  maximum  quality  q(B,  f)  =  0  until  A  is  completed  and  the  result  is  available,  when  the  maximum 
quality  will  change  to  the  initial  maximum  quality  q(5,  t)  =  qo(5). 

2.5.1  Execution  Model 

For  this  paper  we  use  an  extremely  simple  model  of  execution.  Agents  can  perform  three  actions:  method 
execution,  communication,  and  information  gathering.  The  control  component  of  an  agent  determines 
the  next  action  an  agent  will  perform  based  on  the  agent’s  current  set  of  beliefs  [1,  13].  A  method 
execution  action,  of  method  M,  that  is  begun  at  time  t  will  coriclude  at  time  t+d{M,  ^).  An  information 
gathering  action  has  duration  do(/)  and  updates  the  agent's  set  of  beliefs  with  any  new  information  in 
the  environment,  for  example,  the  arrival  of  data  at  the  start  of  an  episode,  or  communications  from 
other  agents.  A  communication  action  has  duration  do(C')  and,  after  a  communication  delay,  makes 
information  (such  method  execution  results)  available  to  other  agents.  The  agent  on  the  receiving  side 
must  perform  an  information  gathering  action  before  the  communication  can  affect  its  local  beliefs. 

2.5.2  Simple  Objective  DSN  Model 

Recall  that  the  summary  of  a  DSN  environment  was  the  tuple  V  — <  A,77,r,  o,  T  >;  this  will  become 
our  generative  model,  especially  the  parameter  7j  (expected  number  of  task  groups).  A  particular  episode 
in  this  environment  can  be  described  by  the  tuple  D  =<  A,  r,  o,  7i, . .  • ,  7^  >  where  n  is  a  random 
variable  drawn  from  an  unknown  distribution  with  location  parameter  (central  tendency)  of  77.  Note 
that  we  make  almost  no  assumptions  about  this  distribution;  its  characteristics  will  differ  for  different 
environments.  For  example,  in  the  description  of  our  DSN  simulation  early  in  Section  2  we  noted 
the  physical  process  by  which  vehicle  tracks  were  generated  and  that  the  length  of  the  tracks  was  not 
normally  distributed. 

Each  task  group  Ti  is  associated  with  a  track  of  length  and  has  the  same  basic  objective  structure, 
based  on  the  DVMT: 
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•  li  Vehicle  Location  Methods  (VLM  s)  that  represent  processing  raw  signal  data  at  a  single  location 
to  a  single  vehicle  location  hypothesis. 

•  li  —  1  Vehicle  Tracking  Methods  (VTM  s)  that  represent  short  tracks  connecting  the  results  of  the 
VLM  at  time  t  with  the  results  of  the  VLM  at  time  i  +  1. 

•  1  Vehicle  Track  Completion  Method  (VCM)  that  represents  merging  all  the  VTM  s  together  into 
a  complete  vehicle  track  hypothesis. 

Non-local  precedence  effects  exist  between  each  method  at  one  level  and  the  appropriate  method  at  the 
next  level  as  shown  in  Figure  4 — two  VLMs  precede  each  VTM,  and  all  VTM’S  precede  the  lone  VCM. 


Figure  4:  Objective  task  structure  associated  with  a  single  vehicle  track. 

If  we  assume  that  each  VLM  has  initial  duration  do  (VLM)  and  each  VTM  has  the  initial  duration 
do  (VTM),  then  we  can  see  from  the  task  structure  that  for  each  task  group  the  total  execution  time 
taken  by  a  single  processor  agent  will  be: 

Zido(VLM)  +  {k  -  l)do(VTM)  +  do(VCM)  (4) 

This  task  structure  is  a  simplification  of  the  real  DVMT  task  structure.  For  example,  there  is  no 
sensor  noise  (which  will  cause  facilitation  relationships  between  tasks,  and  there  is  no  confusion  caused 
by  ghost  tracks’.  Adding  these  features  to  the  task  structure  will  cause  some  interesting  phenomena  that 
we  will  discuss  briefly  in  the  conclusions. 

3  Model  Summary 

This  section  has  described  our  basic  model  of  the  DSN  environment,  beginning  with  the  basic  pa¬ 
rameters  V  =<  A,Ti,r,o,T  > .  A  particular  episode  in  this  environment  can  be  indicated  as 
D  =<  AjT*,  o,  >,  where  each  of  the  n  task  groups  has  the  structure  mentioned  above. 

This  section  also  developed  expressions  for  5,  the  number  of  low-level  sensor  subtasks  (i.e.,  VLMs)  at 
the  most  heavily  loaded  agent,  the  number  of  task  groups  N  at  the  most  heavily  loaded  agent,  and  the 
average  number  of  agents  a  that  see  a  single  task  group.  Another  way  to  look  at  these  results  is  that  we 
have  derived  the  probability  distributions  of  these  variables,  i.e.,  if  the  system  of  agents  as  a  whole  sees 
n  total  task  groups,  then  the  distributions  of  N  and  S  are: 

VT[N  =  N\n]  =  (5) 
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Pt[S^s\N  =  N]  =  9a,Nfi.5{s)  (6) 

5  =  (rS  +  (r/2)(i\r-S))  (7) 

Finally  Eq.  4  derived  the  duration,  or  amount  of  processing  work,  involved  in  a  single  task  group  from 
its  structure. 

In  the  next  section  we  will  combine  these  results  with  simple  local  scheduling  and  coordination 
algorithms  appropriate  for  static  and  dynamic  organizations.  This  will  allow  us  to  describe  the  perfor¬ 
mance  of  a  system  of  agents  following  one  of  these  organizational  algorithms  in  a  particular  episode  or 
environment. 

4  Static  vs.  Dynamic  Organizational  Structures 

Now  we  have  the  necessary  background  to  analyze  static  and  dynamic  organizational  structures.  We 
have  equations  for  the  maximum  expected  number  of  subtasks  at  an  agent  given  the  number  of  task 
groups  seen  S ,  the  maximum  expected  number  of  task  groups  seen  given  the  total  number  Ny  and  the 
predicted  number  of  agents  sensing  part  of  a  task  group  a.  The  key  to  static  structures  is  to  divide  up 
the  overlap  area  a  pnori  (rather  than  to  penalize  agents  for  doing  redundant  work  in  the  overlap  area 
[7]).  The  key  to  dynamic  organizational  structures  is  to  transfer  tasks  so  that  all  the  agents  resources  are 
used  efficiently. 

We  will  repeat  the  assumptions  we  discussed  at  the  start  of  Section  2  on  page  1:  the  agents  are 
homogeneous  (have  the  same  capabilities  with  respect  to  receiving  data,  communicating,  and  processing 
tasks),  the  agents  are  cooperative  (interested  in  maximizing  the  system  performance  over  maximizing 
their  individual  performance),  the  data  for  each  episode  is  available  simultaneously  to  all  agents  as 
specified  by  their  initial  organization,  and  there  are  only  structural  (precedence)  constraints  within  the 
subtasks  of  each  task  group. 

4. 1  Analyzing  Static  Organizations 

In  a  static  organization,  agents  divide  the  overlapping  areas  of  their  ranges  as  evenly  as  possible.  The 
result  is  a  new  area  of  responsibility  r'  =  r  -  |  for  each  agent  with  no  overlap  (see  Figure  5).^  Given 
the  task  structure  as  described  in  Section  2.5  and  shown  in  Figure  4,  and  any  raw  data  or  communicated 
task  results  provided  by  information  gathering  actions,  the  agent  can  at  any  time  build  a  list  of  currently 
executable  methods  (under  the  set  of  precedence  constraints).  Also,  at  any  time  an  agent  can  build  a  list 
of  methods  that  need  to  be  executed,  but  cannot  be  because  their  precedence  constraints  have  not  yet 
been  met.  The  communication  action  in  this  algorithm  is  a  broadcast  of  the  highest  level  results  of  all 
the  task  groups  an  agent  has  worked  on.  Each  agent  follows  the  same  control  algorithm  (remember,  all 
the  raw  data  is  available  at  the  start)  and  terminates  when  all  task  groups  are  completed  (either  locally 
or  by  reception  of  the  result  from  another  agent): 


The  reason  for  overlap  v/ill  be  apparent  in  dynamic  structures — multiple  agents  can  work  in  an  overlapping  area  without 
paying  any  cost  for  communicating  raw  data  between  them.  Overlap  can  also  provide  redundancy  in  case  of  agent  failure. 


8 


(Repeat 

Do  Information-Gathering-Action 
(Repeat 

Let  E  =  [get  set  of  currently  executable  methods] 

(For  method  In  E 

Do  Method- Execution-Action  (method)) 

Until  (null  E)) 

Do  Communication-Action  (broadcast  highest-level  results) 

Let  W  =  [get  set  of  methods  still  waiting  on  precedence  constraint^ 
Until  (null  W)) 


Figure  5:  Example  of  a  3x3  organization,  r  =  11,  o  =  5,  with  5  tracks.  The  thick  dark  grey  boxes  outline  the 
default  static  organization,  where  there  is  no  overlap. 

First,  let  us  analyze  this  algorithm  assuming  that  only  one  task  group  (vehicle  track)  is  present.  In 
the  environment  V  —<  A,  7?,  r,  o,  T  >,  if  we  let  5'  represent  the  largest  amount  of  low-level  data 
in  one  task  group  seen  by  any  agent,  and  a  the  total  number  of  agents  that  see  the  task  group  (from 
Equation  3),  then  the  amount  of  time  it  will  take  that  agent  to  construct  a  complete  solution  is  equal  to 
the  amount  of  time  it  will  take  for  the  initial  information  gathering  action  (do(/))  plus  the  amount  of 
time  to  do  all  the  local  work  (5Mo(VLM)  +  {S^  —  l)do(VTM)),  communicate  that  work  (do(C)), 
get  the  other  agents  results  (do(/)),  plus  the  amount  of  time  to  combine  results  from  the  other  a  —  1 
agents  ((a  —  l)do(VTM)),  plus  time  to  produce  the  final  complete  task  group  result  (do(VCM)),  plus 
communicate  that  result  to  everyone  (do(C)).  For  simplicity  we  will  assume  that  do(/)  and  do(C)  are 
constant  and  do  not  depend  on  the  amount  of  data.  Note  also  that  since  this  agent  is  the  most  heavily 
loaded,  by  definition  all  other  agents  will  finish  their  work  and  have  communicated  it  by  the  time  this 
agent  finishes  its  local  work. 

In  the  general  case,  if  the  system  sees  n  =  77  total  task  groups,  then  the  expected  amount  of  low-level 
sensor  data  (size  of  the  initial  data  set)  at  the  maximally  loaded  agent  can  be  derived  from  the  marginal 
expected  value  for  S  given  the  joint  distribution  of  S  (Eqns.  6,  7)  and  N  (Eq.  5): 

n  N 

Y.9A,n,^iN)ga,N.pi^){rs  +  Un  -  s))  (8) 

N=0  5-0  ^ 

Similar  to  the  single  task  group  case,  the  total  time  until  termination  for  an  agent  receiving  an  initial  data 
set  of  size  S  is  the  time  to  do  local  work,  combine  results  from  other  agents,  and  build  the  completed 
results,  plus  two  communication  and  information  gathering  actions: 

5do(VLM)  -f  (S  -  7V)do(VTM)  -f  (a  -  l)iVdo(VTM)  +  iVdo(VCM)  -f  2do(/)  +  2do(C)  (9) 
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We  can  use  Eq.  9  as  a  predictor  by  combining  it  with  the  probabilities  for  the  values  of  S  and  N  given 
in  Eqns.  6,  7,  and  5  (this  is  very  similar  to  the  treatment  in  Eq.  8). 

We  tested  these  predictions  of  Equation  9  versus  the  mean  termination  time  of  our  DSN  simulation 
over  10  repetitions  in  each  of  43  randomly  chosen  environments  from  the  design  space  [2  <  r  <  10, 0  < 
o  <r,l  <  <  5, 1  <  TV  <  10].  The  durations  of  all  tasks  were  set  at  1  time  unit,  as  were  the  duration 

of  information  gathering  and  communication  actions;  we  will  demonstrate  and  discuss  the  effect  of  this 
assumption  later  in  the  paper.  We  used  the  simulation  validation  statistic  suggested  by  Kleijnen  [9] 
(where  y  =  the  predicted  output  by  the  analytical  model  and  y  =  the  output  of  the  simulation): 


(Var(y)  +  Var(2/))V2 


(10) 


where  Var(y)  is  the  predicted  variance.®  The  result  z  can  then  be  tested  for  significance  against  the 
standard  normal  tables.  In  each  case,  we  were  unable  to  reject  the  null  hypothesis  that  the  actual  mean 
termination  equals  the  predicted  mean  termination  at  the  a  =  0.05  level.  For  non-statisticians:  this 
is  a  good  thing.  The  null  hypothesis  is  that  our  prediction  is  the  same  as  the  actual  value,  we  did  not 
wish  to  reject  it,  and  we  did  not.  However,  such  a  test  has  problems  since  there  are  other  possible 
reasons  which  might  prevent  us  from  rejecting  the  null  hypothesis.  The  best  approach  to  avoiding  this 
problem  is  to  report  not  only  the  alpha  level  of  the  test  but  also  the  tests  power.  With  any  statistical 
hypothesis  test  there  are  four  possible  outcomes:  we  are  unable  to  reject  the  null  hypothesis  when  it  is 
in  reality  true  ,  we  are  unable  to  reject  the  null  hypothesis  when  it  is  in  reality  false  (called  a  Type  II 
error),  we  reject  the  null  hypothesis  when  it  is  in  reality  false,  and  we  reject  the  null  hypothesis  when 
it  is  in  reality  true  (called  a  Type  I  error).  The  a  level  is  essentially  the  probability  of  a  Type  I  error. 
The  power  of  a  test  is  (1  —  /?),  where  /3  is  essentially  the  probability  of  a  Type  II  error  (accepting  a  false 
null  hypothesis).  Unfortunately,  estimating  the  power  of  a  test  is  often  difficult,  and  requires  detailed 
knowledge  of  the  distribution  of  the  test  statistic  under  the  condition  that  the  alternative  hypothesis  is 
true  (this  is  turn  usually  requires  the  computation  of  the  power  for  several  likely  alternative  hypotheses). 
Future  work  will  involve  the  computation  of  the  power  for  this  statistic,  or  the  development  of  a  new  test 
based  on  other  possible  statistics  that  have  better- understood  power  characteristics.  Figure  6  shows  the 
mean  of  10  repetitions  in  each  environment  versus  the  expected  value  and  its  likelihood  intervals.  Thus 
the  analytical  model  describes  the  implementation  fairly  well,  and  we  could  use  the  analytical  model 
to  design  a  good  static  organization  for  a  given  environment,  using  standard  heuristic  optimization 
techniques  such  as  simulated  annealing. 


4.2  Control  Costs 

The  control  algorithm  presented  above  is  simple  and  not  necessarily  optimal.  By  communicating 
only  when  there  is  no  local  work  to  be  done,  the  heaviest-loaded  agent  gives  up  the  chance  for 
other  agents  to  do  the  high-level  composition  in  a  task  group  by  incrementally  transmitting  each 
result  (or  set  of  results  on  a  single  task  group)  as  it  is  completed — a  maximum  potential  savings  of 
~~  1)(®  “  l)do(VTM)  -f-  [N  —  l)do(VCM).  However,  this  needs  to  be  balanced  with  the  cost  of 
multiple  communication  actions,  which  is  {N  —  l)do(C).  Thus  the  question  oTwhen  to  communicate^ 
(when  to  incrementally  transmit  partial  results)  rests  directly  on  the  cost  of  communication  relative  to 
(a  —  l)do(VTM)  +  do(VCM)  (which  depends  on  both  the  basic  method  durations  and  the  agents’ 
organizational  structure). 

This  simple  control  algorithm  can  be  analyzed  easily,  unlike  many  other  systems  where  control  costs 
are  ignored.  If  we  view  the  cost  of  control  as  the  time  spent  by  an  agent  when  performing  an  action 

®The  predicted  variance  of  Equation  9  can  be  easily  derived  from  the  statistical  identity  Var( a:)  =  —  (-^[^j])^- 
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Figure  6:  Actual  system  termination  versus  analytic  expected  value  and  analytically  determined  50%  and  90% 
likelihood  intervals.  Runs  arbitrarily  ordered  by  expected  termination  time. 


(executing  a  method,  information  gathering,  communication),  then  our  algorithm  runs  in  constant  time 
between  actions  except  for  the  two  tests  [get  set  of  currently  executable  method^  and  [get  set  of  methods  still 
waiting.  Each  of  these  in  the  worst  case  requires  a  constant-time  test  of  each  element  of  the  full  task 
structure,  which  is  of  size  Oijjr).  Thus  we  see  how  the  control  costs,  too,  are  related  to  organizational 
structure. 

4.3  Analyzing  Dynamic  Organizations 

In  the  dynamic  organizational  case,  agents  are  not  limited  to  the  original  organization  and  initial  distri¬ 
bution  of  data.  Agents  can  re-organize  by  changing  the  initial  static  boundaries  (changing  responsibilities 
in  the  overlapping  areas),  or  by  shipping  raw  data  to  other  agents  for  processing  (load  balancing).  We 
will  assume  in  this  section  that  the  agents  do  not  communicate  about  the  current  local  state  of  problem 
solving  directly  (see  Section  5  on  using  meta-level  communication).  A  clearer  distinction  is  that  in  the 
dynamic  organization  each  agent  makes  its  initial  decision  (about  changing  boundaries  or  shipping  raw 
data)  without  access  to  non-local  information.  In  the  meta-level  communication  situation  discussed 
later  the  agent  has  access  to  both  its  local  information  and  a  summary  of  the  local  state  of  other  agents. 
In  either  case  the  decision  to  dynamically  change  the  organization  is  made  only  once,  at  the  start  of  an 
episode. 

In  the  case  of  reorganized  overlapping  areas,  agents  may  shift  the  initial  static  boundaries  by  sending 
a  (very  short)  message  to  overlapping  agents,  telling  the  other  agents  to  do  all  the  work  in  the  overlapping 
areas.  The  effect  at  the  local  agent  is  to  change  its  effective  range  parameter  from  its  static  value  of 
r'  in  r  -  o/2  to  some  value  r"  where  r  -  o/2  >  r"  >  r  -  o,  changing  the  first  two  terms  of  Equation  9, 
and  adding  a  communication  action  to  indicate  the  shift  and  an  extra  information  gathering  action 
to  receive  the  results.  Section  4.4  discusses  a  particular  implementation  of  this  idea  that  chooses  the 
partition  of  the  overlapping  area  that  best  reduces  expected  differences  between  agents  loads  and  averages 
competing  desired  partitions  from  multiple  agents. 

In  the  second  case,  an  agent  communicates  some  proportion  p  of  its  initial  data  to  a  second  agent, 
who  does  the  associated  work  and  communicates  the  results  back.  Instead  of  altering  the  effective  range 
and  overlap,  this  method  directly  reduces  the  first  two  terms  of  Equation  9  by  the  proportion  p.  The 
proportion  p  can  be  chosen  dynamically  in  a  way  similar  to  that  of  choosing  where  to  partition  the 
overlap  between  agents  (Section  4.4). 

Whether  or  not  a  dynamic  reorganization  is  useful  is  a  function  of  both  the  agents  local  utility  and 
also  the  load  at  the  other  agent.  We  will  again  be  concentrating  on  the  agent  with  the  heaviest  load. 
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Looking  first  at  the  local  utility,  to  do  local  work  under  the  initial  static  organization  with  n  task  groups, 
the  heaviest  loaded  agent  will  take  time: 

5'do(VLM)  +  {S'  -  n)do(VTM)  (11) 

When  the  static  boundary  is  shifted  before  any  processing  is  done,  the  agent  will  take  time 

do(Cshort)  +  5"do(VLM)  +  (5"  --  n)do(VTM)  +  do(/)  (12) 

to  do  the  same  work,  where  Cghort  is  a  very  short  communication  action  which  is  potentially  much 
cheaper  than  the  result  communications  mentioned  previously,  and  S"  is  calculated  using  When 
balancing  the  load  directly,  local  actions  will  take  time 

do(Ciong)  +  pS'doiVLM)  +  p{S'  -  n)do(VTM)  +  do(/)  (13) 

where  do(Cioiig)  is  potentially  much  more  expensive  than  the  communication  actions  mentioned  earlier 
(since  it  involves  sending  a  large  amount  of  raw  data).  If  the  other  agent  had  no  work  to  do,  a  simple 
comparison  between  these  three  equations  would  be  a  sufficient  design  rule  for  deciding  between  static 
and  either  dynamic  organization. 

Of  course,  we  cannot  assume  that  the  other  agent  is  not  busy;  the  best  we  can  do  a  priori  (without 
meta-level  communication  during  a  particular  episode)  is  to  assume  the  other  agent  has  the  average 
amount  of  work  to  do.  We  can  derive  a  priori  estimates  for  the  average  initial  work  at  another  agent 
from  Equation  9  by  replacing  the  probability  function  of  the  max  order  statistic  ^a.iV.pC^)  with  the 
simple  binomial  probability  function  6jv',p(5).  Therefore  without  any  meta-level  communication,  a 
system  of  agents  could  choose  intelligently  between  static,  dynamic  overlap  reorganization,  and  dynamic 
load  balancing  given  these  constraints. 

4.4  Dynamic  Coordination  Algorithm  for  Reorganization 

This  section  describes  a  particular  implementation  of  the  general  idea  described  earlier  of  dynamically 
reorganizing  the  partitions  between  agents  for  the  DSN  simulation.  This  implementation  will  keep  each 
agents  area  of  responsibility  rectangular,  and  relaxes  competing  constraints  from  other  agents  quickly 
and  associatively  (the  order  of  message  arrival  does  not  affect  the  eventual  outcome).  To  do  this,  the 
message  sent  by  an  agent  requests  the  movement  of  the  four  corridors  surrounding  an  agent.  The 
northern  corridor  of  Agent  1 ,  for  example,  is  the  northern  agent  organizational  responsibility  boundary 
shared  by  every  agent  in  the  same  row  as  Agent  1.  As  can  be  seen  in  Figure  7,  a  3x3  organization  has 
four  corridors  (between  rows  1  and  2,  2  and  3,  and  between  columns  1  and  2,  2  and  3). 

The  coordination  algorithm  described  here  works  with  the  local  scheduling  algorithm  described 
earlier  in  Section  4. 1 .  This  is  consistent  with  our  view  of  coordination  as  a  behavior  [4],  The 

only  modification  to  the  local  scheduler  is  that  we  prevent  it  from  scheduling  local  method  execution 
actions  until  our  initial  communications  are  completed  (the  initial  and  reception  phases,  described 
below). 

The  coordination  algorithm  is  then  as  follows.  During  the  initial  phase  the  local  scheduler  schedules 
the  initial  information  gathering  action,  and  we  proceed  to  the  second  phase,  reception.  In  the  second 
phase  we  use  the  local  information  to  decide  what  organizational  design  to  use,  and  the  parameter  values 
for  the  design  we  choose.  To  do  this  we  calculate  the  duration  of  our  (known)  local  work  (Eq.  11), 
and  then  estimate  that  duration  under  the  alternative  organizations  (dynamic  reorganization  or  load¬ 
balancing).  When  a  parameter  needs  to  be  estimated,  we  do  so  to  minimize  the  absolute  expected 
difference  between  the  amount  of  work  to  be  done  locally  and  the  amount  of  work  done  at  the  remote 
agent  that  is  impacted  the  most  by  the  proposed  change. 
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Figure  7:  On  the  left  is  a  3x3  static  organization,  on  the  right  is  the  dynamic  reorganization  result  after  agents  3, 
4,  5  and  7  attempt  to  reduce  their  areas  of  responsibility  by  one  unit.  In  this  example  the  corridors  running  North 
to  South  have  been  moved  closer  by  two  units  to  reduce  the  load  on  agents  4,  5,  and  6  in  the  second  column. 

For  example,  when  dynamically  restructuring,  if  the  overlap  between  agents  is  more  than  2  units, 
we  have  a  choice  of  reducing  the  area  an  agent  is  responsible  for  by  more  than  1  unit  (this  is  the 
organizational  design  parameter  p  in  question).  To  decide  on  the  proper  reduction  (if  any),  each  agent 
computes  its  known  local  work  T^(p)  using  Eq.  1 1  with  the  actual  (not  estimated)  5'  and  N  computed 
assuming  the  agents  area  is  reduced  by  p.  Then  the  agent  finds  the  value  of  p  that  minimizes  the 
difference  in  its  known  local  work  W{r  —  p)  and  the  average  work  W{r  +  p)  at  the  other  agent: 

S{r,s,N)  =  (J’s+ -  s)) 

W{r,s,N)  =  S(7-,5,iV)do(VLM)  +  (S(7',s,iV)-iV)do(VTM) 

n  N 

E[W{r)]  =  YlJ2gA,n,-^iN)ga,N,p{s)W{r,s,N) 

N=0  s-0 
n  N 

N=0  s=0 

Equation  15  is  just  a  restatement  of  Eqn.  14  (itself  derived  from  Eqns.  5,  6,  7,  and  1 1  for  the  case  of  the 
average,  not  maximally  loaded,  agent  (thus  the  use  of  6,  the  binomial  probability  function  rather  than 
g,  the  max  order  statistic  p.f.). 

If  p  =  0,  then  the  agent  will  not  restructure.  If  p  0,  then  the  agent  sends  a  message  to  all  affected 
agents  requesting  a  reduction  of  amount  p  in  each  corridor  (north,  east,  south,  and  west).  The  agent 
sets  its  current  area  of  interest  to  include  only  the  unique  (non-overlapping)  portion  of  its  area  (if  any), 
and  enters  the  unique-processing  phase.  During  this  phase  the  regular  local  scheduler  described  earlier 
controls  method  execution  actions. 

When  no  more  methods  unique  to  this  agent  can  be  executed,  the  coordination  algorithm  checks 
the  current  time.  If  enough  time  has  passed  for  the  messages  from  other  agents  (if  any)  to  arrive 
(this  depends  on  the  communication  delays  in  the  system),  the  coordination  algorithm  schedules  an 
information-gathering  action  to  retrieve  the  messages.  Note  that  every  agent  may  reach  this  point  at  a 
different  time;  agents  with  a  large  amount  of  unique  local  work  may  take  some  time,  agents  with  no 
work  at  all  will  wait  idle  for  the  length  of  communication  delay  time  in  the  system. 


(14) 

(15) 
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At  this  point  each  agent  will  relax  its  borders  according  to  the  wishes  of  the  other  agents.  The 
relaxation  algorithm  we  have  chosen  is  fairly  simple  and  straightforward,  though  several  similar  choices 
are  possible.  The  algorithm  is  symmetric  with  respect  to  the  four  corridors  surrounding  the  agent, 
so  we  will  just  discuss  the  relaxation  of  the  northern  corridor.  There  will  be  a  set  of  messages  about 
that  corridor,  some  wanting  it  moved  up  by  some  amount  and  some  wanting  it  moved  down  by  some 
amount — we  will  consider  these  as  positive  and  negative  votes  of  some  magnitude.  The  relaxation 
algorithm  sums  the  votes,  and  returns  the  sum  unless  it  is  larger  than  the  maximum  vote  or  smaller 
than  the  minimum  vote,  in  which  case  the  max  or  min  is  returned,  respectively.  At  this  point  the  agent 
enters  the  final  normal  processing  phase,  and  the  local  scheduler  schedules  all  further  actions  as  described 
earlier, 

4.5  Analyzing  the  Dynamic  Restructuring  Algorithm 

As  we  did  in  Section  4.1,  we  can  develop  an  expression  for  the  termination  time  of  any  episode  where 
the  agents  follow  this  algorithm.  To  do  so,  we  start  with  the  basic  termination  time  given  all  of  the 
random  variables.  This  equation  is  derived  from  Eqns.  14  and  15: 

r(r,  5,  iV,  s,  N)  =  max[T^(r  -  p,  5,  N),  W{v  +  p,  s,  N)]  (16) 

where  p  is  computed  as  described  in  the  last  section  using  the  given  values  of  (r,  5,  N,  a,  N).  To  turn  this 

into  a  predictive  formula,  we  then  use  the  expressions  for  the  probabilities  of  the  terms  5,  iV,  5,  and  N 
(from  Eqns.  6,  7,  and  5).  For  example,  we  can  produce  an  expression  for  the  expected  termination  of 
the  algorithm: 

n  N  n  R 

E  E  E  (17) 

JV’=0  5=0 -^^=0 

We  tested  the  predictions  of  Equation  17  versus  the  mean  termination  time  of  our  DSN  simulation 
over  10  repetitions  in  each  of  10  randomly  chosen  environments.  The  durations  of  all  tasks  were  set 
at  1  time  unit,  as  were  the  duration  of  information  gathering  and  communication  actions,  with  the 
exception  of  the  4  environments  described  in  the  next  section.  Using  the  same  validation  statistic  as 
before  (Eq.  10)  in  each  case  we  were  unable  to  reject  the  null  hypothesis  that  the  actual  mean  termination 
equals  the  predicted  mean  termination  at  the  a  =  0.05  level.^  * 

4.5.1  Increasing  Task  Durations 

Figure  8  compares  the  termination  of  static  and  dynamic  restructuring  organizations  on  identical  episodes 
in  four  different  environments.  From  left  to  right,  the  environments  were  [A=:9,r  =  9,o=9,n  =  7], 
[A  =  4,  r  =  9,  o  =  3,  n  =  5],  [A  =  16,  r  =  8,  o  =  5,  n  =  4],  [A  =  9,  r  =  10,  o  =  6,  n  =  7]. 
Ten  different  episodes  were  generated  for  each  environment.  In  order  to  see  the  benefits  of  dynamic 
restructuring  more  clearly,  we  chose  task  durations  for  each  environment  similar  to  those  in  the  DVMT: 
do(VLM)  =  6,  do(VTM)  =:  2do(VCM)  =  2.®  Note  that  the  dynamic  organization  often  does 
significantly  better  than  the  static  organization,  and  rarely  does  much  worse — remember  that  is  many 
particular  episodes  that  the  dynamically  organized  agents  will  decide  to  keep  the  static  organization, 
although  they  pay  a  constant  overhead  when  they  keep  the  static  organization  (one  extra  communication 
action  and  one  extra  information  gathering  action,  given  that  the  time  for  a  message  to  reach  all  agents 
is  no  longer  than  the  communication  action  time). 

^The  same  caveats  we  discussed  earlier  still  apply. 

The  idea  being  that  the  VLM  methods  correspond  to  lowest  three  DVMT  KSIs  as  a  group,  and  the  other  methods 
correspond  to  single  DVMT  KSIs,  and  that  a  KSI  has  twice  the  duration  of  a  communication  action. 
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Figure  8:  Paired-response  comparison  of  the  termination  of  static  and  dynamic  systems  in  four  different  environ¬ 
ments  (ten  episodes  in  each).  Task  durations  are  set  to  simulate  the  DVMT  (see  text). 


5  Using  Meta-Level  Communication 

For  some  environments  D  =<  A^rj^r^o^T  >  one  of  the  three  organizational  choices  may  be  clearly 
better  in  the  long  run,  but  for  most  environments  the  choice  is  not  so  clear  given  the  variance  in 
system  performance.  The  choice  that  optimizes  performance  over  the  long  run  is  often  not  optimal  in 
any  particular  episode.  Taking  the  essential  equations  for  local  work  in  Section  4.3,  we  can  compute 
likelihood  intervals  on  the  predicted  performance  of  an  organization  under  each  of  the  three  coordination 
regimes  by  combining  the  local  likelihood  interval  on  the  expected  load  of  the  heaviest  loaded  agent, 
and  the  likelihood  interval  on  the  average  agent  load.  These  results,  for  the  50%  likelihood  interval,  are 
shown  in  Figures  9  and  10.  Again  we  have  assumed  that  all  execution,  communication,  and  information 
gathering  action  durations  have  the  same  value  (making  communication  relatively  expensive).  The  first 
figure,  Figure  9,  highlights  how  the  relationship  between  performance  under  a  static  organization  and 
a  dynamically  load  balanced  organization  changes  as  the  number  of  agents  increases.  As  expected,  load 
balancing  becomes  more  desirable  as  the  number  of  agents  increases  (in  relation  to  the  average  number 
of  tracks):  when  there  are  many  agents,  the  average  agent  load  becomes  very  low,  which  offsets  the  cost 
of  transferring  tasks.  In  this  figure  the  performance  difference  between  static  and  overlap  reorganization 
remains  nearly  constant  relative  to  the  number  of  agents. 


Number  of  Agents  [r=10,  o=3] 

Figure  9:  50%  likelihood  intervals  on  the  expected  termination  of  a  system  under  three  coordination  regimes, 
different  numbers  of  agents,  and  three  values  of  n 

The  second,  Figure  10,  points  out  how  dynamically  reorganizing  the  overlap  area  increases  the 
performance  over  static  organization  as  the  amount  of  overlap  increases.  For  this  graph  we  assumed  that 
the  agents  would  shrink  their  entire  area  of  responsibility  (as  opposed  to  minimizing  the  difference  in 
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maximum  versus  average  work  as  described  in  Section  4.4).  This  graph  shows  the  need  for  dynamically 
calculating  the  shrinkage  parameter  (p)  especially  at  high  levels  of  overlap  (note  how  the  dynamically 
reorganized  organization  is  predicted  to  do  worse  at  high  levels  of  overlap  in  the  n  =  20  portion).  The 
expected  performance  difference  between  the  static  organization  and  load  balancing  remains  relatively 
constant  across  changing  values  of  o.  In  both  figures  we  have  let  do(C)  =  do(Cshort)  =  do  (Cion  ): 
increasing  the  differences  in  these  values  will  move  the  corresponding  curves  directly  up  or  down. 


Figure  1 0:  50%  likelihood  intervals  on  the  expected  termination  of  a  system  under  three  control  regimes,  different 
overlaps,  and  three  values  of  n.  Likelihood  intervals  on  the  load  balanced  line  are  omitted  for  clarity. 

The  next  series  of  figures  demonstrate  the  effect  of  the  ratio  of  computation  duration  to  communica¬ 
tion  duration.  This  and  subsequent  figures  assume  that  the  dynamic  restructuring  shrinkage  parameter 
p  IS  set  to  minimize  the  difference  between  maximum  and  average  local  work  as  described  in  Section  4  4 
Figure  11  shows  how  the  expected  value  and  90%  likelihood  interval  on  system  termination  changes 
as  the  duration  of  a  method  execution  action  changes  from  equal  to  (lx)  a  communication  action  to 
10  times  (lOx)  that  of  a  communication  action.  The  task  structure  remains  than  of  the  DSN  example 

described  in  Section  2.5.  In  Figure  11  we  see  a  clear  separation  emerge  between  static  and  dynamic 
termination. 


Figure  11:  Effect  of  decreasing  communication  costs  on  expected  termination  under  a  static  organization  and 
dynamic  restructuring  (expected  value  and  90%  likelihood  interval,  =  25,  t-  =  9,  o  =  9,  n  =  7). 

•  .^Sures  assume  that  the  number  of  task  groups  n  is  known  beforehand.  The  reason  for  this 

IS  to  highlight  the  variance  implicit  in  the  organization,  and  minimize  the  influence  of  the  external 
environment.  Figure  12  shows  how  much  extra  variance  is  added  when  only  the  expected  value  of  n, 
which  IS  7?,  IS  known.  We  assume  that  the  number  of  task  groups  n  (in  the  DSN  example,  vehicle 
tracks)  that  occur  during  a  particular  episode  has  a  Poisson  distribution  with  an  expected  value  oft/. 


Overlap  [A=9,  r=10] 
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The  discrete  probability  function  for  the  Poisson  distribution,  given  in  any  statistics  book,  is  then: 

Priiy)  =  [Pr[n  =  y]] 

We  can  use  this  probability  in  conjunction  with  Eqns.  8,  11,  and  15  to  calculate  the  expected  value, 
50%,  and  95%  likelihood  intervals  on  termination  in  the  static  or  dynamic  organizations.  Note  in 
Figure  12  both  the  large  increase  in  variance  when  n  is  random  (between  the  top  bar  and  the  third 
bar  in  the  figure),  and  more  importantly  the  small  decrease  in  variance  in  the  dynamic  restructuring 
organization  (between  the  first  and  the  second  bars).  Note  also  that  the  mean  termination  time  for  the 
dynamic  organization  is  slightly  less  than  that  for  the  static  organization. 

9S%  Likelihoqd  Interval 


50%  Likelihqod Interval 


Figure  12:  Demonstration  of  both  the  large  increase  in  performance  variance  when  the  number  of  task 
groups  n  is  a  random  variable,  and  the  small  decrease  in  variance  with  dynamic  restructuring  coordination 
[74.  =  9,r  =  9,o  =  9].  Where  n  is  known,  n  =  7.  Where  n  is  a  random  variable,  the  expected  value  rj  =  7. 

These  figures  bring  us  to  the  final  point  of  this  paper:  often  system  performance  can  be  improved 
significantly  by  dynamic  reorganization,  but  it  will  rarely  always  be  improved.  Therefore,  meta-level 
communication  between  agents  about  their  local  loads  can,  with  a  small  communication  cost,  pinpoint 
the  true  costs  and  benefits  of  the  various  organizational  structures,  allowing  an  informed  organizational 
decision  to  be  made.  Instead  of  an  agent  making  a  decision  about  restructuring  or  load  balancing  by 
assuming  the  average  load,  the  agent  will  have  the  actuallodA  for  the  neighboring  agents.  As  we  said  in 
the  introduction,  the  proper  organization  is  often  one  that  exploits  information  that  resolves  uncertainties 
about  the  current  environment  as  it  becomes  available  to  the  agents,  allowing  the  agents  to  then  create 
the  most  efficient  organization  for  the  situation. 

6  Conclusions 

The  results  of  this  paper  can  be  looked  at  from  three  points  of  view.  From  the  practitioner  s  viewpoint, 
the  analysis  presented  here  resulted  in  a  set  of  design  equations  that  can  be  used  directly  to  optimize  the 
performance  of  a  simple  DSN,  or  explore  the  design  space  given  some  model  of  how  expensive  agents 
are  and  what  bounds  (mean,  median,  90%  quantile)  on  their  performance  are  required.  Several  of  the 
simplifying  assumptions  we  used,  such  as  constant  communication  and  information  gathering  costs, 
can  be  easily  replaced  with  submodels  chosen  by  the  designer.  From  the  viewpoint  of  the  distributed 
AI  community,  we  have  returned  to  look  at  the  some  of  the  problems  first  studied  by  Durfee,  Lesser, 
and  Corkill[7].  They  concluded  that  “Our  intent  is  to  show  that  overly  specialized  organizational 
structures  allow  effective  network  performance  in  particular  problem-solving  situations,  but  that  no 
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such  organization  is  appropriate  in  all  situations.  In  this  paper  we  reach  the  same  abstract  conclusion, 
but  also  show  precisely  what  the  effect  is  of  a  particular  organizational  structure  (characterized  by  both 
its  structural  components  and  its  coordination  algorithm)  in  an  environment  (characterized  by  the 
structure  and  frequency  of  its  tasks)  in  a  clear  way  that  not  only  allows  us  to  predict  performance  but 
to  explain  it.  The  technique  of  using  binomial  approximations  should  also  prove  useful  in  different 
domains.  From  the  viewpoint  of  the  general  research  community  this  paper  presents  a  methodology  for 
answering  questions  about  the  design  of  a  system  by  analysis  and  simulation.  In  such  a  methodology, 
the  observation  of  particular  phenomena  in  a  complex  system  (the  DVMT)  leads  to  the  building  and 
verification  of  general  models  that  predict  and  explain  such  phenomena. 

In  the  short  term,  this  work  leads  to  the  explanation  of  other  interesting  distributed  problem  solving 
phenomena  displayed  in  [7].  The  addition  of  noise  at  DSN  sensors  leads  to  the  necessity  of  more  complex 
coordination  with  the  introduction  of  more  complex  task  interrelationships  (such  facilitation[A]) .  The 
addition  of  correlated  noise  in  the  environment  can  then  cause  these  new,  more  complex  coordination 
mechanisms  to  break  down,  producing  the  phenomenon  recognized  as  distraction.  In  the  long  term, 
we  are  working  towards  a  complete  characterization  of  generalized  partial  global  planning[3]  as  a  first 
step  towards  a  theory  of  coordination  in  distributed  problem  solving,  ^^e  and  other  researchers  are  also 
using  our  task  structure  characterization  for  the  analysis  and  simulation  of  problems  in  real-time  and 
parallel  scheduling. 
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A  Distribution  of  the  Binomial  Max  Order  Statistic 

To  expand  on  the  formulae  in  Section  2.2,  the  amount  of  a  single  task  group  seen  by  an  agent  can  be 
viewed  as  a  random  variable  S  with  probability  density  function  f{s)  and  corresponding  cumulative 
distribution  function  F{s).  Let  a  be  the  number  of  agents  that  initially  see  part  of  a  single  task 
group.  By  elementary  statistics,  the  density  of  the  max  order  statistic  Smax  =  max  (5i,  52, . . . ,  So)  is 
^^(5)  =  aF(s)“'"^/(s).  The  expected  heaviest  load  then  is  sga{s)ds.  In  the  DSN  environment, 
5  is  discrete,  and  its  probability  function  (determined  empirically)  is  heavily  weighted  toward  r  (the 
maximum).  To  simplify  the  analysis,  instead  of  letting  5  correspond  to  the  number  of  subtasks  in  a 
single  task  group  seen  by  an  agent,  we  have  a  new  random  variable  S  equal  1  if  the  agent  sees  the 
maximum  amount,  and  0  otherwise.  Now  S  has  a  Bernoulli  (coin-tossing)  distribution  with  parameter 
p  corresponding  to  the  chance  of  an  agent  seeing  the  maximum  amount  r  of  a  task  group.  S  then 
corresponds  to  the  number  of  times  an  agent  sees  the  maximum  r  if  the  agent  sees  N  task  groups 
(tracks).  S  has  a  binomial  distribution  with  parameters  p  and  N,  Now  if  a  agents  see  N  task  groups 

each,  what  is  the  distribution  of  S  ~  max  (Si,  S2, . . . ,  Sa)?  This  is  the  max  order  statistic  for  the 
binomial  distribution,  and  because  it  is  discrete,  can  be  easily  derived.  The  probability  function  and 
cumulative  distribution  function  for  the  binomial  distribution  are: 

^nA^)  =  [Pi-[S  =  s]] 

■SAr,p(s)  =  E®=o  ^iv,p(®)  [Pr  [S  <  sj] 

If  we  assume  each  track  is  independent  of  the  others,  we  can  derive  the  cumulative  distribution  function: 

Pr[S<s]  =  Pr[max(Si,S2,...,S„)  <  s] 

=  Pr[Si  <  s]Pr[S2  <  s]---Pr[Sa  <  s] 


The  probability  function  5a,JV,p(s)  =  Pr  [S  =  s]  is  then: 


3a, nA^)  —  ~  ■SjV,p(s  —  1)“ 
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Abstract 

The  functionally-accurate,  cooperative  (FA/C)  distributed  problem-solving  paradigm  is  one  approach  for 
organizing  distributed  problem  solving  among  homogeneous,  cooperating  agents.  The  idea  behind  the  FA/C 
model  is  that  agents  should  produce  tentative,  partial  results  based  on  only  local  information  and  then 
exchange  these  results,  exploiting  the  constraints  that  exist,  among  their  local  subproblems  to  resolve  the 
uncertainties  and  global  inconsistencies  that  result  from  the  use  of  incomplete  information.  While  several 
FA/C  systems  have  been  implemented,  there  has  been  little  formal  analysis  of  the  quality  of  the  solutions 
that  can  be  produced  using  the  approach  or  of  the  conditions  that  are  necessary  for  the  approach  to  be 
effective.  This  paper  reports  on  work  we  have  done  to  formally  analyze  the  FA/C  model  in  the  context  of 
distributed  sensor  interpretation  (SI).  Several  results  are  presented  that  compare  the  quality  of  solutions 
produced  by  a  distributed  FA/C  system  to  those  produced  by  an  equivalent  centralized  system,  based  on 
particular  agent  problem-solving  and  coordination  strategies.  We  first  establish  that  while  it  is  possible  for 
an  FA/C  system  to  produce  the  same  solution  as  a  centralized  system,  this  requires  the  use  of  interpretation 
and  coordination  strategies  that  are  impractical  for  most  SI  applications.  Because  of  this  we  then  consider 
the  effect  of  “approximate”  interpretation  and  coordination  strategies,  given  some  assumptions  about  the 
characteristics  of  the  domain. 


*  This  work  was  supported  in  part  by  the  Department  of  the  Navy,  Office  of  the  Chief  of  Naval  Research,  under  contract 
N00014-92-J-1450.  The  content  of  the  information  does  not  necessarily  reflect  the  position  or  the  policy  of  the  Government, 
and  no  official  endorsement  should  be  inferred. 


1  Introduction 


In  the  functionally  accurate,  cooperative  (FA/C)  paradigm  for  distributed  problem  solving  [8,  10], 
agents  need  not  have  all  the  information  necessary  to  completely  and  accurately  solve  their  sub- 
problems.  Instead,  agents  are  designed  to  produce  tentative,  partial  results  based  on  only  local 
information  and  to  then  exchange  these  results  with  the  other  agents  to  resolve  local  uncertainties 
and  global  inconsistencies.  The  basic  intuition  behind  this  approach  is  that  for  many  applications 
there  exist  (inter-agent)  constraints  among  the  subproblems,  and  these  constraints  can  be  exploited 
to  resolve  the  inconsistencies  and  uncertainties  that  occur  in  local  problem  solving  due  to  the  lack 
of  accurate,  complete,  and  up-to-date  information. 

While  several  systems  that  use  the  FA/C  approach  have  been  built  (e.g.,  [3,  9]),  there  has  never 
been  any  formal  analysis  of  the  quality  of  the  solutions  that  can  be  produced  by  the  approach  or 
of  the  effect  of  domain  conditions  and  coordination  strategies  on  solutions.  This  paper  reports  on 
our  efforts  in  formally  characterizing  the  FA/C  paradigm.  We  will  present  some  results  about  the 
quality  of  solutions  that  can  be  produced  by  an  FA/C-based  system  relative  to  particular  agent 
problem  solving  and  coordination  strategies.  Since  most  FA/C  applications  have  been  in  distributed 
sensor  interpretation  (SI)  (particularly  distributed  vehicle  monitoring)  our  analysis  focuses  on  this 
domain.^  Though  we  concentrate  on  this  domain,  we  believe  that  our  basic  results  will  be  applicable 
to  other  domains  in  which  local  problem  solving  involves  approximate,  satisficing  search. 

First,  it  is  important  to  be  clear  about  what  we  will  and  will  not  consider.  [10]  referred  to  the 
issues  of  solution  uncertainty  and  control  uncertainty  m  FA/C  problem  solving,  and  these  concepts 
are  useful  in  delineating  the  focus  of  our  efforts.  In  this  paper  we  concentrate  on  the  issue  of  solution 
uncertainty:  uncertainty  about  the  correctness  of  partial  solutions  due  to  agents  having  incomplete 
local  information.  Control  uncertainty  is  largely  concerned  with  what  actions  agents  should  take 
and  when  they  should  take  them  to  achieve  efficient,  coherent  global  problem  solving.  Coordination 
of  agent  activities  involves  both  solution  and  control  uncertainty,  but  we  will  ignore  the  issue  of  when 
information  should  be  communicated  and  the  possible  communication  of  meta-level  information. 
Thus,  while  we  examine  the  conditions  that  are  necessary  iov  effective  FA/C  problem  solving,  these 
conditions  are  not  sufficient  to  guarantee  it.  Coordination  strategies  for  efficient  FA/C  problem 
solving  remain  an  important  area  of  research.^  Another  important  issue  in  the  design  of  FA/C 
systems  is  the  role  that  agent  architectures  play  in  supporting  effective  FA/C  problem  solving.  For 
example,  some  architectures  allow  a  wide  range  of  inconsistencies  to  be  resolved  using  very  directed, 
limited  communication  of  data  and  hypotheses  among  the  agents  [3,  4].  Finally,  there  are  other 

Distributed  sensor  interpretation  plays  an  important  role  in  many  military  situation  assessment  (decision  support) 
systems. 

We  are  pursuing  both  empirical  and  analytic  approaches  to  address  this  issue  [6,  7]. 
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factor,  that  might  prevent  effective  FA/C  problem  solving  in  certain  domains,  but  they  are  beyond 
the  scope  of  this  paper.^ 

The  m,,in  performance  that  we  address  is  the  quality  of  solutions  produced  by 

an  A/C  system.  Several  theorems  are  presented  that  compare  the  solutions  of  an  FA/C  system  to 
e  solufons  that  wouid  be  produced  by  an  “equivalent”  centralised  system,  given  particuiar  agent 
problem-solving  and  coordination  strategies.  A  key  issue  in  this  ewduation  is  the  fact  that  approx¬ 
imate,  satsdicmg  techniques  are  often  required  to  identify  SI  solutions  (see  Section  3).  Because  of 
ns  here  are  „veral  standards  against  which  we  might  judge  solution  quality.  We  have  chosen 
o  e  ne  the  globaUy  best  solution”  as  the  solution  that  would  be  produced  by  an  equiuu/ent 
centr^ae  system:  a  centralised  system  using  the  same  interpretation  strategy  as  the  individual 
istnbuted  agents  use  for  processing  their  own  local  data.  The  advantage  of  this  dehnition  is 

a  It  partially  separates  the  effects  of  approximate  local  strategies  from  the  effect  of  distributed 

processing. 

We  also  consider  the  effect  of  different  coordination  strategies,  in  terms  of  the  completeness 
with  which  subproblem  (local  interpretation)  interactions  are  pursued.  The  analysis  assumes  the 
capabiht.es  of  an  abstract  model  of  the  DRESUN  system  for  distributed  SI  (3,  4].  In  particular, 
assume  t  at  all  possible  subproblem  interactions  can  be  easily  identified  and  that  they  can 
e  selective  y  (and  incrementaUy)  pursued  (see  Section  4).  Again,  there  are  several  alternative 
s  ^  eg.es  that  might  be  used.  These  range  Rom  “complete  strategies”  that  effectively  ehminate 
effects  of  the  distribution  of  data,  to  “incomplete  strategies’  that  do  not. 

Our  results  begin  by  showing  that  it  is  possible  for  a  DRESUN-based  distributed  SI  system  to 
pro  uce  exactly  the  same  global  interpretation  as  would  be  produced  by  a  centralised  system  and 
a  IS  can  in  eed  be  the  optimal  solution.  However,  this  is  true  only  when  the  centralised  system 
and  the  agents  are  doing  complete,  exact  interpretation  and  subproblem  interactions  are  completely 

A  ke'"  '  T  “^mptions  for  SI  in  genera]  and  for  distributed  SI 

d  for  excessive  communication  among  the  agents.  This  is  the  main  reason  for  using  incomplete 
c  ordination  strategies:  to  make  PA/C  problem  solving  efficient  enough  to  be  effective.  The  res, 
of  our  results  focus  on  the  effects  of  incomplete,  approximate  coordination  strategies 

n  exploring  the  use  of  incomplete  coordination  strategies,  the  most  important  issue  we  will 
xp  ore  IS  w  et  er  FA/C  systems  can  focus  on  the  subproblem  interactions  involving  only  the  local 

r2:::::r:: 

unaey,  his  strategy  does  not  m  general  guarantee  that  the  global  solution  would  be  be 

than  l"  d7'„fro“y  '’"'‘“P  '’“"'“i »l«lio"s 
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the  optimal  solution  or  even  the  same  as  what  would  be  produced  by  a  centralized  system.  In 
conjunction  with  this  result,  we  discuss  certain  domain  characteristics  that  would  allow  an  FA/C 
distributed  SI  system  to  produce  solutions  that  are  “satisfactory.”  The  key  concept  that  we  consider 
is  what  we  term  approximate  monotonicity. 

The  final  set  of  results  focus  on  the  use  of  approximate  interpretaton  and  coordination  strategies 
where  solutions  are  selected  using  a  satisficing  approach  in  which  hypotheses  are  judged  part  of 
the  solution  if  their  belief  surpasses  an  “acceptance  threshold.”  This  is  a  common  strategy  for 
many  SI  systems.  Here  the  coordination  strategy  is  incremental  in  that  it  uses  the  same  criteria 
to  determine  the  degree  to  do  pursue  additional  inter- agent  suproblem  interactions  or  not.  We 
show  that  there  are  conditions  under  which  it  is  possible  to  guarantee  that  the  distributed  system 
produces  a  solution  that  is  comparable  to  the  centralized  solution,  and  other  conditions  under  which 
there  is  merely  some  probability  of  obtaining  such  a  solution. 

In  the  next  two  sections  we  describe  our  model  of  (distributed)  sensor  interpretation,  discuss  why 
SI  is  different  from  much  studied  “diagnosis  problems,”  and  then  examine  the  use  of  approximate 
strategies  for  determining  SI  solutions.  Section  4  introduces  the  DRESUN  distributed  SI  model 
that  we  assume  for  our  analysis,  followed  in  Section  5  by  a  description  of  the  coordination  strategies 
that  wiU  be  considered.  Section  6  then  defines  the  notation  that  will  be  used  in  the  remainder  of  the 
paper.  Section  7  contains  the  theorems  and  proofs  related  to  FA/C  solution  quality  for  distributed 
SI.  The  paper  concludes  with  a  summary  of  the  major  results  in  the  paper  and  our  future  research 
plans. 

2  Distributed  Sensor  Interpretation 

By  sensor  interpretation,  we  mean  the  determination  of  high-level,  conceptual  explanations  of  sensor 
data.  Our  model  of  the  interpretation  process  will  be  essentially  that  used  in  [2]:  interpretation 
hypotheses  are  incrementally  constructed  via  abductive  inferences,,  based  on  a  causal  domain  model 
that  defines  the  relationships  among  the  data  types  and  abstraction  types.  For  each  type  T,  the 
causal  model  defines  the  type’s  support,  St^  and  its  possible  explanations,  Ep.  Sp  —  {S^-  :  is 

a  type  instance  specification,,  which  defines  a  legal  support  for  a  hypothesis  of  type  T}  (a  type 
instance  specification  has  an  associated  interpretation  type  and  a  set  of  constraints  that  define  the 
legal  attribute  values  for  hypotheses  of  that  type  to  support  a  hypothesis  of  type  T).  £t  =  {Et  :  E,- 
is  an  interpretation  type  that  might  explain  some  type  T  hypothesis}.  If  T  is  a  data  type  then  St 
is  empty.  If  T  is  a  top-level  type  then  Et  is  empty. 

An  interpretation  system  makes  abductive  inferences  that  identify  possible  explanations  for  a 
piece  of  data  or  a  hypothesis.  Thus,  every  hypothesis  h  with  type{h)  =  T  is  the  result  of  a  set  of 
abductive  inferences  each  of  the  form  hi  =>  /i,  where  type(hi)  G  St^  Conversely,  each  hypothesis 
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h  with  type(h)  =  T  may  be  part  of  a  set  of  abductive  inferences  of  the  form  h  =>  hj,  where 
type{hj)  e  Et-  We  will  use  the  following  notation:  support-oj[h)  =  the  (immediate)  support 
for  h;  suppori$[h^hi)\  and  explains{h^hj).  In  addition  to  its  immediate  support,  each  hypothesis 
must  ultimately  be  supported  by  sensor  data  via  a  chain  of  abductive  inferences  c! 
hm,  where  d  is  data  and  supports{hi,d)^  supports{hi,hi^i).  To  deal  with  these  inference  chains, 
we  will  use  the  notation  support- of *{h)  to  refer  to  that  data  that  ultimately  supports  h,  as  well  as 
the  analogous  d),  and  explains*{d^hm)- 

Abductive  inferences  are  uncertain  rather  than  logically  correct  inferences  that  provide  evidence 
for  the  hypotheses  rather  than  conclusively  proving  them.  The  key  source  of  uncertainty  for  any 
hypothesis  is  the  possibility  of  alternative  explanations  for  the  data  that  supports  the  hypothesis. 
Even  if  complete  support  can  be  found  for  a  hypothesis  (i.e.,  there  is  a  hypothesis  or  datum  in 
support- ofi^h)  that  corresponds  to  each  G  iSj)?  l^he  hypothesis  may  still  be  uncertain  as  a  result 
of  competing,  alternative  explanations  for  support-of*{h).  Furthermore,  because  hypotheses  are 
incrementally  constructed,  they  may  not  have  complete  support  at  a  given  point  in  the  processing 
even  if  complete  support  could  be  found  in  the  available  data  (complete  support  might  not  be  able 
to  be  found  even  if  the  hypothesis  is  correct  due  to  masking,  environmental  disturbances,  or  sensor 
errors). 

An  interpretation  of  a  data  set  is  an  explanation  what  caused  all  of  the  data.  In  general,  an 
interpretation  will  be  a  composite  of  a  set  of  hypotheses  whose  types  are  from  a  specified  subset  of 
the  interpretation  types  (the  explanation  corpus  [12]),  each  of  which  explains  some  subset  of  the 
data,  and  which  together  explain  all  of  the  data  set.  Formally,  given  data  set  D  and  interpretation 
/,  if  hi  G  I  then  support- of \hi)  G  D  and  D  =  the  union  of  support- of *{hi)  forall  hiinl.  A  solution 
to  an  interpretation  problem  is  an  interpretation  of  the  data  that  is  judged  best  according  to  some 
criteria  (we  will  return  to  this  issue  below). 

In  a  centralized  SI  system,  aU  of  the  sensor  data  is  available  to  the  single  agent.  In  a  distributed 
system,  each  agent  has  (direct)  access  to  data  from  only  a  subset  of  the  sensors,  and  each  sensor  is 
associated  with  a  single  agent.  As  a  result,  each  agent  monitors  only  a  portion  of  the  overall  area 
of  interest,  and  agents’  local  solutions  must  be  combined  in  order  to  construct  a  global  solution. 
Construction  of  a  global  solution  may  not  be  straightforward,  however,  because  the  local  solutions 
are  often  not  independent  and  may  in  fact  be  inconsistent  because  they  are  based  on  different 
incomplete  subsets  of  the  data.  Agent  solutions  are  interdependent  whenever  data  (evidence)  for 
hypotheses  are  spread  among  multiple  agents  or  when  agent  areas  of  interest  overlap  as  a  result  of 
overlapping  sensor  coverage. 

In  this  paper  we  are  interested  in  analyzing  the  quality  of  the  global  solutions  that  can  be 
produced  by  FA/C  distributed  interpretation  systems.  To  do  this  we  need  some  standard  for 
comparison — i.e.,  the  “globally  best  solution.”  Usually  one  would  like  this  to  be  the  most  probable 
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explanation  (MPE)  [12]  given  all  of  the  globally  available  data.^  The  problem  with  defining  the 
globally  best  solution  to  be  the  global  MPE  is  that  for  most  real-world  SI  problems  it  is  impractical 
to  compute  the  (true)  MPE.  For  this  reason  and  because  our  main  interest  is  the  effect  of  FA/C 
distributed  problem  solving,  we  have  chosen  to  compare  distributed  SI  solutions  to  those  of  an 
equivalent  centralized,  system.  By  this  we  mean  the  solution  that  would  be  produced  by  a  centralized 
(single-agent)  system  that  has  access  to  aU  of  the  data  of  the  distributed  system  and  that  uses  the 
same  (possibly  approximate)  interpretation  strategies  as  are  used  by  the  distributed  agents. 

The  complexity  of  determining  the  true  MPE  can  be  understood  by  considering  the  differences 
between  SI  problems  and  the  hinds  of  problems  that  are  typically  studied  in  research  on  abductive 
inference  and  probabilistic  network  inference  (e.g.,  [12,  13]).  For  simplicity,  we  wiU  refer  to  these 
problems  as  diagnosis  problems.  One  key  difference  is  that  diagnosis  problems  are  propositional 
while  SI  IS  not,  in  general.  In  other  words,  diagnosis  problems  have  a  fixed  set  of  causes  and  possible 
findings,  with  fixed  relations  among  them.  The  problem  of  identifying  the  MPE  in  such  svstems 
has  been  studied  extensively  (e.g.,  [12,  13]).  By  contrast,  while  SI  problems  have  a  fixed  set  of 
“top-level”  cause  types  and  a  fixed  set  of  data  types,  they  can  have  an  indeterminate  number  of 
instances  of  any  of  the  types.  For  example,  in  a  vehicle  monitoring  system,  an  unknown  number  of 
vehicles  wiU  have  been  responsible  for  the  overall  data  set  and  each  vehicle  will  produce  a  ‘Hrack” 
of  data  points  that  can  be  of  varying,  unknown  lengths.® 


Because  there  can  be  an  indeterminate  number  of  instances  of  any  top-level  cause,  a  key  prob¬ 
lem  for  SI  is  what  is  known  in  the  target  tracking  literature  as  the  data  association  problem  [1]: 
which  target  should  data  be  associated  with?  The  data  association  problem  and  the  possibility 
of  an  indeterminate  number  of  top-level  causes  lead  to  the  problem  of  correlation  ambiguity  (it  is 
ambiguous/uncertain  which  potential  explanation  hypothesis  a  data  evidence  should  be  associated 
with)  and  to  a  combinatorial  explosion  of  possible  explanations  for  a  data  set  (potentially,  every 
single  piece  of  data  could  have  a  unique  source  instance  as  its  cause).® 

Other  problems  faced  by  SI  systems,  but  not  by  most  diagnosis  problems  relate  to  the  na- 
ture  of  sensor  data:  evidence  from  sensor  data  is  not  conditionally  independent  even  given  a 
^onrce/explanation  hypothesis  and  there  can  be  massive  amounts  of  data  resulting  from  multiple 

i.i..impir  .h. 

MPE  c.„  b.  Incorrect  wh™  ,1"  ."Ifa ‘mho  L  by  difeeol  even,..  The 

Also,  the  MPE  may  not  be  the  “best”  solution  one.  when  the  system  has  a  poor  domain  model, 

instance,  failing  to  warn  of  a  possible  attlv  °ne  considers  what  wiU  be  done  with  the  interpretation.  For 

tailing  to  identffy  .  (n.ili.y)  than 

co„‘.°Ln:.t  itrpt'n'stwlr.  rtf' 

very  uect.i:  |„je- J.  n.hf  .nr.liSrc.  f  IT"  >l“y> 

•In  general  it  i,  olten  impcLible  t“  aZLTT  T'  r  l'""d'ed»  of  .inrnl.aneou.  vehicles, 

where  target,  m.,  be  poorly  sensed  (i.e.,  often  mild  byrbeTents)  “Pmaily  »  applications 
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passive  sensors  continuously  operating  in  a  noisy  environment  J  One  way  that  we  might  sum  up 
the  differences  is  to  say  that  in  diagnosis  problems  the  key  source  of  complexity  is  the  need  to 
consider  all  combinations  of  causes  for  the  evidence,  while  in  SI  problems  it  is  the  explosion  of 
possible  interpretations  as  large  amounts  of  data  are  considered.  Multiple  cause  combinations  are 
often  less  of  a  problem  for  SI  because  data  can  often  be  assumed  to  have  single  causes,  so  most 
source  hypotheses  are  alternatives.  ■ 

Despite  the  differences  between  SI  and  diagnosis  problems,  it  is  useful  to  think  of  the  networks  of 
data  and  hypotheses  that  SI  systems  construct  as  being  very  similar  to  probabilistic  belief  networks. 
Key  differences  to  keep  in  mind  are:  the  SI  hypothesis  network  wiU  be  incrementally  constructed  as 
data  is  interpreted,  the  complete  network  of  possible  interpretations  may  not  be  constructed,  belief 
computations  are  often  approximate,  a  noisy- OR  model  is  usually  not  appropriate  for  modeling 
source/explanation  interactions,  and  while  networks  may  be  drawn  in  such  a  way  that  data  appears 
to  be  conditionally  independent  this  is  often  not  the  case.  Also,  instead  of  random  variables,  in  SI 
we  talk  about  data  and  hypotheses,  which  can  be  complex,  multi-attribute  entities. 

3  Approximate  Interpretation  Strategies 

To  deal  with  the  characteristics  discussed  in  the  previous  section,  SI  systems  must  be  construc¬ 
tive  [2,  5]  and  they  must  use  approximate,  satisficing  strategies  to  determine  solutions.  It  is  not 
the  point  of  this  paper  to  investigate  approximate  strategies  for  SI,  however  to  understand  the 
significance  of  some  of  our  results  one  must  understand  that  that  most  real-world  SI  systems  are 
approximate — even  without  the  issues  of  distributed  SI.  We  address  this  in  our  results  by  consider¬ 
ing  the  effect  of  approximate  interpretation  strategies  as  weU  as  the  effect  of  different  coordination 
strategies.  Interpretation  strategies  are  specified  in  general  terms,  since  the  particular  strategies 
employed  wiU  be  largely  domain-specific.  We  describe  alternative  strategies  via  assumptions  about 
the  characteristics  of  the  interpretations  that  result  from  the  strategies  that  agents  use  for  their 
local  data.  In  the  remainder  of  this  section,  we  will  give  some  sense  of  the  kinds  of  approximation 
techniques  that  SI  systems  can  use  and  their  general  effects.  The  distributed  SI  model  we  describe 
in  the  next  section  is  based  on  the  RESUN/DRESUN  architecture,  which  provides  great  flexibility 
for  implementing  approximate  interpretation  strategies  [2,  3]. 

There  are  five  basic  approximation  techniques  that  can  be  used  by  SI  systems:  process  only 
part  of  the  available  data,  construct  only  some  of  the  possible  interpretation  hypotheses  for  the 
processed  data,  compute  approximate  belief  ratings  (conditional  probabilities)  for  the  hypotheses,® 
consider  only  some  of  the  possible  composite  interpretations  (hypothesis  combinations),  and  use 

^The  probabilistic  belief  network  community  is  just  starting  to  become  aware  of  the  data  association  problem  and 
the  issues  it  raises  that  have  not  been  addressed  in  the  work  on  diagnosis  problems — e.g.,  [11]. 

®We  will  use  the  term  “belief”  to  mean  the  degree-ohbelief  accorded  to  a  hypothesis  or  interpretation.  This  will 
be  either  the  conditional  probability  or  approximate  conditional  probability  of  the  object. 
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complete  propagation 


incomplete  propagation 


Figure  1:  An  example  of  approximate  interpretation  due  to  incomplete  evidence  propagation. 

In  the  complete  propagation  case,  the  system  not  only  has  created  the  most  probable  explanation,  /ii,  it  also  has 
created  the  alternative  explanations,  /12  and  hs  (using  the  most  complete  support  possible).  This  allows  the  system 
to  determine  the  conditional  probability  of  hi  given  the  available  data  In  the  incomplete  propagation 

case,  the  alternative  explanations  for  hi  have  not  been  created.  This  means  that  the  belief  computed  for  hi  is  only 
an  approximation  of  the  true  conditional  probability  since  the  likelihood  of  the  alternative  explanations  has  been 
only  approximately  considered,  hi  is  still  uncertain,  though,  because  the  possibility  of  alternative  type  2  and  3 
explanations  for  each  piece  of  supporting  data  is  known,  as  are  the  a  priori  likelihoods  of  these  explanations.  (Note 
that  for  simplicitly,  the  figure  does  not  show  the  numerous  incompletely  supported  versions  of  the  hypotheses  that 
also  might  have  been  created  in  the  case  of  complete  propagation.) 

some  method  other  than  maximum  joint  probability  to  select  the  solution  interpretation.  For 
example,  the  RESUN  SI  framework  allows  the  use  of  heuristic  knowledge  to  control  the  data  that 
is  processed  and  the  hypotheses  that  are  constructed,  and  it  allows  solutions  to  be  assembled  from 
hypotheses  whose  belief  ratings  simply  surpass  some  acceptance  threshold. 

Obviously,  these  techniques  are  not  independent.  If  a  system  does  not  process  all  of  the  data 
then  it  cannot  in  general  create  all  possible  interpretations  of  the  complete  data  set.  If  a  system 
does  not  create  every  possible  interpretation  of  its  data  (i.e.,  every  possible  hypothesis  h  such  that 
explains*{D^h))^  this  not  only  limits  the  interpretations  that  can  be  considered,  it  also  results 
in  hypothesis  belief  ratings  being  only  approximations  of  the  true  conditional  probabilities  of  the 
hypotheses.  This  is  because  incomplete  hypothesis  construction  results  in  incomplete  propagation 
of  the  effects  of  evidence  (we  are  using  “evidence  propagation”  in  the  same  basic  sense  that  [12] 
refers  to  “belief  propagation”).  Figure  1  provides  an  example  of  this  situation. 

The  bottom  line  is  that  these  approaches  will  result  in  SI  solutions  that  are  only  approximations 
of  the  MPE:  they  may  be  incomplete  or  they  may  not  be  the  most  probable  composite  interpretation. 
In  the  following  sections  we  will  consider  both  exact  and  approximate  interpretation  strategies 
being  applied  by  agents  to  process  their  locally  available  data.  We  will  use  the  name  exact  local 
interpretation  strategy  to  refer  to  a  strategy  in  which  an  agent’s  local  solution  (ignoring  global 
interactions)  would  be  the  true  MPE  of  the  complete  local  data  set.  The  term  approximate  local 
interpretation  strategy  will  be  used  to  refer  to  any  strategy  that  is  not  exact.  We  will  be  more 
precise  about  the  type  and  effect  of  approximate  local  strategies  only  as  necessary. 
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One  issue  that  we  have  not  yet  discussed,  but  which  is  of  great  importance  with  many  approx¬ 
imate  interpretation  and  coordination  strategies  is  the  termination  problem:  has  the  system  done 
enough  work  to  get  a  satisfactory  solution?  For  Si  this  means  has  the  system  processed  enough 
data,  performed  sufficient  evidence  propagation,  and  evaluated  enough  interpretations  to  have  a 
reasonably  good  solution?  We  will  return  to  the  issue  of  termination  criteria  in  our  discussion  of 
coordination  strategies  and  in  the  results. 

4  The  DRESUN  Distributed  SI  Model 

In  an  FA/C  system,  there  must  be  some  mechanism  to  drive  interactions  among  the  agents  so 
that  incorrect  and  inconsistent  local  solutions  can  be  detected  and  dealt  with.  Ideally,  this  would 
be  accomplished  with  a  mechanism  that  allowed  agents  to  understand  where  there  are  constraints 
among  their  subproblems,  so  that  information  interchange  could  be  highly  directed.  DRESUN  [2, 

3,  4]  provides  this  capability,  and  it  will  form  the  basis  for  our  model  of  the  capabilities  of  an  FA/C 
agent. 

DRESUN  agents  create  symbolic  source  of  uncertainty  statements  (SOUs)  to  represent  all  of  the 
reasons  why  their  hypotheses  are  uncertain  based  on  their  local  evidence.  Whenever  it  is  determined 
that  a  hypothesis  can  obtain  evidence  from  another  agent — i.e.,  whenever  a  subproblem  interaction 
(constraint)  is  detected— DRESUN  agents  create  global  consistency  SOUs  (GSOUs)  to  represent  the 
potential  interactions.  Each  GSOU  denotes  that  another  agent  might  be  able  to  provide  additional 
evidence  that  would  corroborate  or  contradict  the  associated  local  agent  hypothesis.®  GSOUs  are 
viewed  as  sources  of  uncertainty  about  the  correctness  of  an  agent’s  local  solution  because  until  a 
GSOU  is  “resolved,”  it  is  uncertain  whether  the  associated  hypothesis  is  globally  consistent.  In  this 
way,  the  basic  DRESUN  problem-solving  goal  of  resolving  interpretation  uncertainty  drives  agents 
to  communicate  and  produce  solutions  that  are  globally  consistent. 

To  see  that  it  is  possible  to  identify  all  possible  subproblem  interactions,  consider  that  in  SI  there 
are  just  three  classes  of  evidential  interactions  between  a  local  interpretation  hypothesis  and  an 
external  agent:  the  external  agent  might  have  duplicate  support  evidence  due  to  overlapping  sensor 
coverage,  it  might  have  support  evidence  for  the  hypothesis,  or  it  might  have  explanation  evidence. 
AU  instances  of  these  situations  can  be  easily  detected  given  a  local  hypothesis,  the  SI  domain  model, 
and  knowledge  of  the  organization  of  agent  interest  areas.  DRESUN  uses  three  GSOUs  to  denote 
instances  of  these  global  interactions:  consistent-overlapping-model,  consistent-global-extension,  and 

sometimes  denote  that  an  agent  (or  group  of  agents)  is  a  definite  sontce  of  evidence  and  sometimes 
only  that  an  agent  (or  agents)  is  a  possible  source  of  evidence.  Uncertainty  arises  from  things  like  only  partial  coverage 
by  an  agent  of  the  region  from  which  evidence  for  a  hypothesis  might  originate— e.g.,  the  possible  locations  for  the 
source  vehicle  for  a  ghost  track  might  include  area  not  completely  monitored  any  agent. 
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Consistent  Overlapping  Model 


Consistent  Global  Extension 


Consistent  Global  Evidence 


Figure  2:  Examples  of  global  consistency  SOUs  for  vehicle  monitoring. 


consistent- global- evidence.  Note  that  because  of  the  special  role  of  “continuous”  support  evidence  in 
SI  (a  key  type  of  support  evidence  that  is  not  conditionally  independent),  a  special  GSOU  is  used  to 
represent  these  support  interactions  while  “individual”  evidence  interactions  are  represented  with 
another  GSOU  (that  is  used  for  both  support  and  explanation  evidence).  Figure  2  shows  examples 
of  vehicle  monitoring  situations  involving  each  of  these  GSOUs. 

To  understand  the  effect  of  approximate  local  problem  solving  (interpretation)  strategies  on 
global  solution  quality,  it  is  important  to  be  clear  that  in  DRESUN  GSOUs  are  created  based 
only  on  the  interpretation  hypotheses  that  are  constructed.  In  other  words,  they  do  not  result 
from  direct  analysis  of  the  data.^^  Thus,  approximate  local  interpretation  strategies  that  do  not 
construct  all  possible  interpretations  may  result  in  a  DRESUN  system  failing  to  detect  the  existence 
of  potential  global  evidential  interactions  (which  might  affect  the  solutions  that  are  chosen). 


5  GSOU  Resolution  Strategies 

While  DRESUN  agents  have  a  representation  of  all  inter-agent  interactions  (for  the  set  of  locally 
created  hypotheses),  we  must  now  consider  what  should  be  done  with  this  information  in  order  to 
develop  global  solutions.  In  other  words,  what  does  resolving  a  GSOU  mean,  what  are  appropriate 
strategies  for  resolving  the  GSOUs,  and  how  do  these  strategies  affect  the  quality  of  the  global 
solutions?  Basically,  resolution  of  a  GSOU  involves  exchanging  information  between  the  associated 
agents  so  as  to  propagate  the  effects  of  each  agent’s  local  evidence  between  the  agents.  An  example 
of  the  resolution  of  a  GSOU  is  shown  in  Figure  3  (the  figure  will  be  discussed  further  below).  Reso¬ 
lution  of  a  GSOU  is  analogous  to  local  (intra-agent)  evidence  propagation  discussed  in  Section  3  in 

While  interactions  could  be  postulated  directly  from  the  data,  this  would  result  in  large  numbers  of  highly 
uncertain  possible  interactions.  For  example,  in  a  large-scale  military  surveillance  system  it  is  possible  for  many 
targets  over  the  entire  area  being  monitored  to  be  acting  in  concert,  as  in  some  attack  scenario.  As  a  result,  a  priori, 
there  is  some  (non-zero)  probability  of  any  two  pieces  of  data  being  interrelated  in  such  a  system. 
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Agent  1 


Agent  2 


Agent  2 


Agent.  1 


incomplete  GSOU  resolution  incomplete  GSOU  resolution 

Figure  3:  Examples  of  resolving  a  global  consistency  SOU  (GSOU). 


connection  with  the  local  interpretation  strategies.  As  in  local  propagation,  resolution  of  a  GSOU 
may  not  be  a  one  shot  process,  but  may  require  periodic  propagation  as  part  of  an  incremental 
interpretation  process.  Thus,  conceptually  we  can  think  of  the  GSOU  resolution  process  as  estab¬ 
lishing  evidential  links  among  the  associated  agents’  hypothesis  networks  (analogous  to  the  links 
in  a  belief  network). 

As  with  the  local  interpretation  process,  there  are  a  range  of  GSOU  resolution  strategies  that 
may  be  used.  These  strategies  differ  in  how  completely  and  accurately  global  interactions  are 
considered,  based  on  different  choices  about  which  global  SO  Us  to  pursue  and  how  completely  to 
assess  the  evidential  effects  of  external  agent  data.  In  considering  what  are  appropriate  GSOU 
resolution  strategies,  we  return  to  the  issue  of  termination  criteria.  A  key  criterion  that  we  wish 
to  impose  on  FA/C  agents  is  that  their  local  solutions  are  “consistent”  at  termination.  This  allows 
the  local  solutions  to  be  merged  into  a  global  solution  without  the  need  for  additional  problem 
solving.  Our  definition  of  global  consistency  of  local  solutions  is  an  evidential  one:  solutions  are 

Since  inter- agent  evidence  propagation  requires  inter- agent  communication,  these  links  are  only  conceptual.  In 
DRESUN,  when  evidence  has  been  communicated  to  an  external  agent  it  is  tagged.  These  tags  alert  the  agent  to 
communicate  later  modifications. 
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consistent  if  their  hypotheses  are  pairwise  identical,  independent,  or  corroborative.  Local  solutions 
are  inconsistent  when  any  of  their  hypotheses  are  contradictory — i.e.,  have  a  negative  evidential 
relationship. 

The  most  comprehensive  resolution  strategy  is  for  all  global  SOUs  to  be  completely  resolved — 
i.e.,  for  all  GSOUs  in  all  agents  to  be  resolved  using  the  entire  set  of  possible  interpretation  hypothe¬ 
ses  constructed  by  the  relevant  external  agents  and  creating  all  possible  interpretation  hypotheses 
from  the  joint  interpretations  (and  their  supporting  data/evidence).  We  will  refer  to  this  as  the 
complete  GSOUs  resolution  strategy.  An  approximate  coordination  strategy  that  meets  the  global 
consistency  criterion  is  for  each  agent  to  resolve  only  those  GSOUs  that  are  associated  with  its 
local  solution  (“best”  interpretation)  and  to  pursue  evidence  propagation  using  only  the  local  solu¬ 
tion  hypotheses  of  the  relevant  external  agents.  We  will  refer  to  this  as  the  local  solutions  GSOUs 
resolution  strategy. 

As  we  noted  earlier,  the  overall  effect  of  these  strategies  is  dependent  on  the  local  interpretation 
strategy  being  used  by  the  agents,  since  resolution  is  based  on  the  interpretation  hypotheses  actually 
constructed  by  the  agents. In  fact,  there  are  other  reasons  why  we  cannot  consider  the  effects  of 
the  global  strategies  in  isolation  from  the  local  interpretation  strategies.  First,  we  must  consider  how 
the  GSOUs  resolution  strategy  interacts  with  the  local  interpretation  strategy  when  the  propagation 
of  evidence  among  the  agents  results  in  an  agent  changing  its  belief  ratings  and/or  creating  new 
hypotheses.  Our  assumption  will  be  that  such  changes  trigger  the  agent  to  revise  its  solution,  using 
the  same  criteria  that  it  had  been  using  for  its  local  interpretation  strategy.  Thus,  an  agent  using 
the  exact  local  interpretation  strategy  will  revise  its  solution  to  be  the  true  MPE  given  its  previous 
evidence  and  its  newly  acquired  evidence.  This  is  a  reasonable  assumption  since  SI  systems  often 
must  be  able  to  incrementally  develop  their  solutions  wfien  they  are  operating  continuously  or 

■a 

providing  users  with  current  interpretations  in  real-time. 

The  second  interaction  issue,  which  is  raised  by  the  need  to  update  the  local  solutions,  is 
what  information  does  the  local  interpretation  strategy  require  be  propagated  between  the  agents. 
The  local  interpretation  strategy  can  have  a  major  impact  on  the  amount  of  information  that 
must  be  transferred  among  the  agents  to  “resolve  GSOUs”  in  the  sense  needed  to  meet  the  global 
consistency  criteria.  For  example,  approximate  local  strategies  may  simply  require  that  global 
resolution  provide  sufficient  information  to  compute  the  revised  hypothesis  beliefs  that  result  from 
any  interactions.  By  contrast,  the  exact  local  interpretation  strategy  has  to  consider  the  joint 
probabilities  of  the  possible  interpretations  since  it  is  to  compute  the  MPE.  In  this  case,  if  we  are 

^^Thus,  a  more  comprehensive  strategy  for  resolving  GSOUs  would  be  one  that  considers  not  just  the  possible 
interpretations  created  by  the  external  agents  but  also  any  of  their  data  that  could  serve  evidence  for  the  local 
hypotheses.  We  will  not  consider  this  alternative — it  completely  confuses  the  effects  of  the  local  and  global  strategies 
because  the  global  strategy  can  effectively  override  the  local  strategy. 
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to  truly  ha>ye  complete  resolution  of  the  GSOUs  then  enough  information  must  be  transferred  to 
allow  each  agent  to  determine  “their  part”  of  the  MPE  given  the  joint  evidence  of  the  agents.  When 
the  MPE  can  be  computed  using  a  local  computation,  message  passing  scheme  (see  the  discussion 
of  belief  revision  in  [12])  this  is  not  a  major  issue.  However,  if  the  agents  require  access  to  aU  of  the 
possible  component  hypotheses  of  the  possible  interpretations  then  substantial  additional  evidence 
may  have  to  be  communicated  (beyond  that  necessary  to  compute  revised  hypothesis  beliefs). 

Figure  3  shows  examples  of  resolving  a  GSOU  for  both  the  exact  local  interpretation  strategy  and 
the  local  solutions  GSOUs  resolution  strategy.  One  thing  to  note  from  this  figure  is  the  effect  that 
global  solution  consistency /inconsistency  has  on  the  amount  of  inter-agent  propagation  that  occurs 
using  the  two  strategies.  As  the  figure  shows,  the  exact  local  interpretation  strategy  performs 
the  same  level  of  inter-agent  evidence  propagation  regardless  of  whether  the  local  solutions  are 
consistent  or  not.  By  constrast,  the  local  solutions  GSOUs  resolution  strategy  is  an  approximate 
'^ategy  because  it  does  not  completely  propagate  the  effects  of  the  agents  evidence  when  the  local 
solutions  are  consistent.  However,  it  performs  exactly  the  same  level  of  propagation  as  the  exact 
local  interpretation  strategy  when  the  local  solutions  are  inconsistent. 

6  Notation 

This  section  defines  the  notation  that  will  used  in  presenting  the  results  and  their  proofs  that 
appear  in  the  following  section.  For  the  centralized  (single-agent)  case,  we  will  use  the  following 
notation: 

•  V  is  the  complete  data  set  available  to  the  system. 

•  Di  denotes  some  subset  of  V  (Di  C  V), 

•  Ci  denotes  an  interpretation  context — i.e.,  a  particular  stage  of  processing.  Each  context 
refers  to  a  specific  data  subset  Di  that  has  been  processed  and  a  particular  set  of  hypotheses 
and  evidential  links  that  have  been  created. 

BEL{hyCi)  denotes  the  true  degree-of-belief  in  hypothesis  h  in  context  Ci — i.e.,  P{h  \  Di), 
where  Di  is  the  data  subset  associated  with  context  Ci,  This  may  not  be  the  belief  rating  ac¬ 
tually  associated  with  h — see  BEL{h,Ci)  below.  Also,  in  general  BEL{h,Di)  7^  BEL{h,V) 
when  Di  C  V,  and  because  interpretation  domains  are  not  typically  monotonic  BEL{h,Di) 
may  be  less  than  or  greater  than  BEL{h,V),  When  the  interpretation  context  is  clear,  we 
will  simply  use  BEL{h), 

•  BEL{h,Ci)  is  the  belief  rating  that  is  actually  associated  with  h  given  the  interpretation 
strategy  being  used.  In  general  BEL{h,  Ci)  ^  BEL{h,  Ci)  unless  the  exact  local  interpretation 

strategy  is  being  used.  As  above,  BEL{h,Ci)  will  be  written  as  BEL[h)  when  the  context  is 
clear. 

•  X{V)  is  the  true  MPE  of  the  complete  data  set  V. 

•  S^{Ci)  is  the  true  MPE  of  the  data  subset  associated  with  context  Ci,  This  may  not  be  the 
solution  that  is  selected  by  the  interpretation  strategy — see  i[Ci)  below. 
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•  is  the  “best  interpretation”  solution  of  the  data  subset  associated' with  context  Ci  given 
the  local  interpretation  strategy  being  used.  We  do  not  specify  how  the  solution  interpretation 
is  selected,  but  it  need  not  be  the  MPE  given  the  approximate  processing  represented  by  Ci 
(the  solution  may  for  instance  have  been  selected  using  acceptance  thresholds). 

For  the  distributed,  multi-agent  case,  modifications  must  be  made  to  account  for  the  distribution 
of  data  and  hypotheses  among  multiple  agents.  In  particular,  the  concept  of  an  interpretation 
context  must  be  extended  for  the  distributed  case  to  deal  with  there  being  multiple  contexts  related 
to  each  agent  and  with  evidence  communication/propagation  among  the  agents  as  part  of  GSOUs 
resolution.  The  following  notation  will  be  used  for  the  distributed  case: 

•  ./I  is  the  set  of  agents  {Ai,^2>  •  •  •}?  with  their  interest  area  specifications. 

•  P*  is  the  complete  data  set  available  (directly)  to  only  agent  Ai — i.e.,  the  complete  data  set 
that  is  available  from  agent  Ai's  own  sensors. 

•  P®  refers  to  the  complete,  globally  available  data  set— i.e.,  the  combined  data  from  all  of  the 
agents,  P*.  P®  =  P  for  an  equivalent  centralized  system. 

AieA 

•  Pj  denotes  some  subset  of  agent  A.-’s  local  data  (Pj  C  P‘). 

•  refers  to  external  data  from  agent  Aj,  d{  ,  that  is  available  to  agent  Ai  because  it  has 
been  communicated  from  Aj  as  part  of  GSOUs  resolution.  Note  that  the  evidential  effects 
of  this  data  may  only  partly  be  known  to  Ai  depending  on  the  local  interpretation  strategy 
being  used  by  Aj. 

•  refers  to  external  data  from  all  the  other  agents,  P[.,  that  is  available  to  agent 

AiMi 

Ai  because  it  has  been  communicated  by  these  agents  as  part  of  GSOUs  resolution. 

•  Cj  denotes  an  interpretation  context  for  agent  Ai.  As  with  Ci  above,  each  context  refers  to  a 
specific  subset  of  the  local  data  Dj  that  has  been  processed  and  a  particular  set  of  hypotheses 
and  evidential  links  that  have  been  created.  In  addition,  it  also  refers  to  any  available  external 
evidence  that  has  been  communicated  from  other  agents  in  the  process  of  resolving  GSOUs 

•  Cf  will  be  used  to  refer  to  a  global  context — i.e.,  Cf  =  [J  Gj. 

AieA 

•  BEL'{h,Cj)  denotes  the  true  degree-of-belief  in  hypothesis  h  in  agent  A,’s  context  Cj.  It 
is  just  P{h  I  D^j,Df~*').  This  may  not  be  the  belief  rating  that  agent  A,-  actually  associates 

with  h  see  BEL  (h,Cj)  below.  When  the  interpretation  context  is  clear,  we  will  simply  use 
BEL\h). 

•  BEL  (h,Cj)  is  the  belief  rating  that  agent  A,-  actually  associates  with  h  in  context  Cj  given 
the  interpretation  strategy  being  used.  BEL  {h,Cj)  ^  BEL’{h,Cj)  unless  the  agents  are  us¬ 
ing  the  exact  local  interpretation  strategy.  As  above,  ^L\h,Cj)  will  be  written  as  ^L\h) 
when  the  context  is  clear. 

•  X\Cj)  is  the  MPE  for  the  data  subset  associated  with  context  Cj. 

•  5s  agent  A,’s  “best  interpretation”  solution  of  the  data  subset  associated  with  context 
gi'ven  the  local  interpretation  and  global  resolution  strategies  being  used.  As  in  the 
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re”lhfwF“'’  ’'".k  °  **"  is  selected  but  it  need  not 

be  the  MPE  given  the  approximate  processing  represented  by  Cj. 

•  ^“(Cy)  denotes  the  combined,  global  “best  interpretation”  solution  of  the  data  subset  as¬ 
sociated  with  the  global  context  Cf.  That  is,  I=(Cy)  =  U  i‘(Cy).  is  defined 

that  satisfies  the  consistency 

7  Solution  Quality  Results 

In  this  section  we  will  examine  the  quality  of  solutions  that  can  be  produced  using  an  FA/C 
approach,  relative  to  those  that  could  be  produced  using  an  equivalent  centralised  approach  To 
rephr^e  the  main  question  we  are  interested  in  addressing:  what  is  the  relationship  between 

I  (C,J  and  I(C,J,  where  C£  and  Cy.  represent  possible  final  contexts  (contexts  that  satisfy  the 
termination  criteria)  for  the  distributed  and  centralised  cases,  respectively? 

7,1  Theorem  1 

In  this  section  we  will  examine  the  quality  of  FA/C  global  solutions  assuming  that  the  centralised 

■  c  nd  the  dtstributed  agents  use  the  ciucf  local  interpcetation  atrateg,  to  process  their  local 

eta  and  that  the  distributed  agents  use  the  complete  GSOUs  resolution  strategy  (the  strategies 
are  described  in  Sections  3  and  5). 

Theorem  1:  «.en  a  centralized  system  that  uses  the  exact  local  interpretation  strategy  a  set 
of  agents  A  each  of  which  uses  the  exact  local  interpretation  strategy  and  the  complete  GSOUs 
resolutton  strategy,  VP  and  V{p.)  with  P«  =  p  (all  pgssihle  diuisions  of  the  data  among  the 

agenu),  interpmtation 

Strategyaad  the  complete  GOVs  resolution  strategy,  the  FA/0  distributed  system  produces  the  same 
exact  mterpretatron  as  the  centraUsed  system,  which  is  the  true  MPE  of  aU  the  globally  available 

Proof:  Clearly  T(Cy.)  =  T(P)  by  the  definition  of  the  erect  iocef  interpretation  strategy  and 
the  fac^t  that  the  centraUsed  system  (agent)  has  access  to  the  complete  data  set  P.  To  show 

that  I<^(C£)  =  I(P)  we  must  show  that  if  h  6  i«(C«)  then  h  £  T(P)  and  if  h  £  I(P)  then 

h  £  I=(Cg).  Suppose  that  h  £  i«(C°).  This  means  that  there  is  at  least  one  A,  £  X  such  that 

he  X’jC’jJ,  We  specified  that  when  using  these  two  strategies  in  conjunction,  enough  evidential 

i'iCfj  since  in  the  final  context  ind'  ./  f  f  ^"dividual  agent  interpretations 

/a;  context  individual  agent  solutions  must  be  consistent  (mergeable). 
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information  must  be  transferred  among  the  agents  so  that  the  local  agent  solutions  are  not  only 
consistent,  but  together  represent  the  MPE  given  the  evidence  based  on  their  local  hypotheses.  If 
we  assume  that  every  agent  creates  every  possible  interpretation  of  its  data  as  part  of  the  exact 
local  interpretation  strategy,  then  this  will  mean  that  every  possible  inter-agent  evidential  relation 
will  be  represented  by  some  GSOU  and  so  will  be  resolved.  Thus,  in  a  final  context  where  all 
GSOUs  have  been  completely  resolved  and  exact  local  interpretations  have  been  computed,  h  must 
be  in  I(X>),  the  MPE  given  the  globally  available  data.  Now,  suppose  that  h  G  T(T>).  If  we  again 
assume  that  agents  are  creating  every  possible  interpretation  hypothesis  as  part  of  the  exact  local 
interpretation  strategy,  h  will  have  been  created  by  at  least  one  agent  Ai.  There  are  two  cases 
to  consider:  ,1.)  BEL\h,V^)  =  BEL{h,V)  and  2.)  BEL\h,V^)  BEL{h,V).  In  the  first 
case,  the  belief  in  h  is  independent  of  the  data  in  all  external  agents.  This  means  that  Ai  can 
locally  assess  the  belief  in  h  and  whether  it  is  part  of  the  MPE  without  any  resolution  of  GSOUs. 
Thus,  if  h  G  T(T>)  then  h  G  and  so  h  £  (by  the  global  consistency  criteria).  In 

the  second  case,  Ai  cannot  locally  assess  the  belief  in  h.  However,  if  we  assume  that  all  possible 
interpretation  hypotheses  are  created  and  aU  GSOUs  are  completely  resolved,  then  agent  Ai  must 
have  the  information  necessary  to  assess  both  the  belief  in  h  and  whether  it  is  in  the  MPE.  Thus 
again,  if  /i  G  T('D)  then  h  G  and  so  h  £  (by  the  global  consistency  criteria). 

This  result  is  what  we  would  expect  given  the  definitions  of  the  exact  local  interpretation  strategy 
and  the  complete  GOUs  resolution  strategy,  and  given  the  assumptions  inherent  in  the  DRESUN 
model  (that  the  GSOUs  represent  all  possible  interactions  for  the  set  of  created  hypotheses).  How¬ 
ever,  it  serves  to  formally  establish  the  theoretical  capabilities  of  a  DRESUN-based  FA/C  dis¬ 
tributed  SI  system.  Other  FA/C  distributed  SI  architectures  have  not  had  this  property  because 
they  have  lacked  a  representation  of  all  the  inter-agent  interactions  and  a  mechanism  for  controlling 
their  resolution.  This  result  shows  that  an  incremental,  distributed  approach  to  SI  need  not  produce 
poorer  quality  solutions  than  a  centralized  system.^^  Furthermore,  the  result  is  not  trivially  obvi¬ 
ous.  We  had  to  make  the  assumption  that  agents  create  all  possible  interpretation  hypotheses  and 
we  also  had  to  be  very  careful  in  how  we  defined  the  interactions  between  the  local  interpretation 
and  GSOUs  resolution  strategies  for  the  result  to  hold. 

Unfortunately,  the  practical  implications  of  this  result  are  very  limited  since  few  SI  systems 
(centralized  or  distributed)  can  afford  to  do  exact  local  interpretation  and  complete  GSOUs  reso¬ 
lution.  The  only  time  such  a  situation  would  be  practical  would  be  if  enough  agents  are  used  so 
that  each  agent’s  local  processing  requirements  are  adequate  to  completely  and  exactly  process  its 

We  assumed  that  both  systems  had  access  to  exactly  the  same  data.  This  is  reasonable  if  only  passive  sensors 
are  being  used  or  if  the  SI  is  not  being  done  in  real-time,  but  is  much  less  reasonable  once  active  sensors  can  be  used 
in  real-time  SI. 
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local  data  and  if  the  global  interactions  are  limited  in  both  number  and  scope.  In  other  words,  if 
few  GSOUs  are  produced  and  those  that  are  can  be  resolved  with  limited  processing.  This  latter 
point  is  important,  since  even  if  there  are  few  GSOUs,  it  may  be  very  costly  to  completely  resolve 
every  one  if  they  can  each  potentially  interact  with  large  numbers  of  interpretation  hypotheses  in 
the  associated  external  agent(s). 

It  is  easy  to  show  that  the  use  of  approximate  local  interpretation  strategies  or  an  incomplete 
GSOUs  resolution  strategy  results  in  the  systems  failing  to  be  guaranteed  of  producing  the  MPE. 
However,  if  we  accept  that  even  centralized  SI  systems  will  have  to  settle  for  sub-optimal  solutions, 
then  what  we  are  reaUy  interested  in  determining  is  whether  distributed  systems  can  produce 
solutions  that  are  equivalent  with  (or  at  least  sufficiently  close  to)  those  produced  by  approximate 
centralized  systems,  and  whether  they  can  do  it  efficiently.^^  It  certainly  is  possible  to  come 
up  with  approximate  local  interpretation  strategies  for  an  approximate  FA/C-based  SI  system  to 
produce  solutions  that  are  equivalent  to  those  of  an  approximate  centralized  system.  However, 
this  depends  on  the  details  of  the  approximate  interpretation  strategy  since  there  are  only  certain 
approximation  strategies  whose  effects  can  be  duplicated  by  a  GSOUs  resolution  strategy  (the 
agents  in  the  distributed  system  wiU  stiU  have  limited  views  of  the  globally  avaiable  raw  data,  for 
instance).  In  the  remaining  results  subsections,  we  wiU  begin  to  consider  approximate  interpretation 
and  coordination  strategies. 

7.2  Theorem  2 

We  are  not  yet  at  the  point  where  we  can  formally  define  and  analyze  the  effects  of  arbitrary  inter¬ 
pretation  and  coordination  strategies.  However,  in  this  section  we  will  undertake  some  evaluation  of 
the  effects  of  the  local  solutions  GSOUs  resolution  strategy.  This  is  an  imi)ortant  strategy  because 
it  can  be  very  efficient  when  local  agent  solutions  are  consistent.  In  addition,  the  developers  of 
the  FA/C  paradigm  clearly  saw  this  as  a  useful  coordination  strategy.  For  example,  [8]  refers  to 
consistency  checking  of  the  tentative  local  solutions  with  results  received  from  other  nodes  as  “an 
important  part  of  the  FA/C  approach.” 

Theorem  2:  Given  a  centralized  system  that  uses  an  arbitrary  (possibly  approximate)  local 
interpretation  strategy  and  a  set  of  agents  A.  each  of  which  uses  this  same  local  interpretation 
strategy  and  the  local  solutions  GSOUs  resolution  strategy,  3P  and  3{V^}  with  —  V  (some 

To  clarify  this  discussion,  we  should  point  out  that  a  distributed  SI  system  does  not  necessarily  do  more  “work” 
than  the  centralized  system  does  to  get  the  MPE  solution.  Completely  resolving  the  GSOUs  results  in  the  distributed 
system  performing  precisely  the  same  evidential  propagation  as  would  have  to  be  performed  by  a  centralized  system. 
Of  course,  doing  this  propagation  via  inter-agent  communication  will  be  more  costly  both  in  terms  of  processing  and 
elapsed  time  than  it  would  be  in  a  centralized  system— that  is  the  issue.  The  basic  complexity  of  SI  is  determined 
by  the  interrelatedness/independence  of  the  possible  data  interpretation  hypotheses,  though. 
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division  of  the  data  among  the  agents)^  such  thatX^{C^^  7^  andi^(Cfj  ^  In  other 

words,  the  local  solutions  GSOUs  resolution  strategy  does  not  guarantee  that  the  FA/C  distributed 
system  produces  the  same  exact  interpretation  as '  the  centralized  system  or  the  true  MPE  of  all 
the  globally  available  data. 

Proof:  Regardless  of  the  local  interpretation  strategy  being  used,  there  must  be  some  data 
and  some  distribution  of  that  data  so  that  there  is  some  hypothesis  h  such  that  h  e  V{V^)  and 
h  E  but  not  h  E  X{Cjf).  The  only  case  in  which  this  would  not  be  true  would  be  a  domain 

in  which  there  was  complete  solution  monotonicity — i.e.,  VjD^  cX),\ih  E  X{Di)  then  h  E  X{V). 

For  example,  it  is  entirely  possible  to  have  P{Ha  1  T^i)  >  F(^6  |  Di)  and  P{Ha  \  D2)  >  P{Hh  \ 
£^2)5  but  P{Ha  I  Di,D2)  <  P{Hh  I  jDi,jD2)  (where  Ha  and  are  competing  hypotheses  and  Di 
and  D2  are  data  sets  in  different  agents).  In  other  words,  the  solution  that  is  locally  most  likely 
in  both  agents  {Hq)  may  not  be  the  globally  most  likely  solution — even  though  the  local  solutions 
are  consistent.  This  is  simply  a  consequence  of  the  nonmonotonicity  inherent  in  reasoning  based 
on  incomplete  and  uncertain  information. 

7.3  Approximate  Monotonicity 

Because  “consistency”  of  local  solutions  does  not  in  general  guarantee  that  the  merged,  global 
^^dntion  is  the  best  solution,  the  local  solutions  GSOUs  resolution  strategy  not  produce  solutions 
0  equivalent  to  an  approximate  centralized  system,  let  alone  the  MPE.  This  is  unfortunate 
since  this  coordination  strategy  can  be  very  efficient.  It  also  conflicts  with  the  intuitions  of  ourselves 
and  other  other  FA/C  researchers  that  this  is  a  reasonable  approximate  FA/C  coordination  strategy. 

We  believe  that  the  reason  for  this  apparent  contradiction  is  that  many  real  world  domains  have 
characteristics  that  allow  the  this  strategy  to  work  quite  well.  For  example,  in  vehicle  monitoring, 
a  considerable  amount  of  evidence  must  typically  be  acquired  to  achieve  a  vehicle  track  hypothesis 
with  high  belief.  While  additional  evidence  can  negatively  affect  the  belief  in  such  a  track,  it  will 
take  substantial  additional  evidence  to  have  a  major  effect  on  the  belief.  In  other  words,  while  belief 
is  nonmonotonic  with  increasing  evidence,  in  some  situations  the  effects  of  additional  evidence  are 
not  totally  unpredictable. 

What  this  observation  means  for  the  “local  solutions  strategy”  is  that  if  a  track  hypothesis  is 
part  of  a  local  solution,  has  “fairly  high”  belief,  and  is  globally  consistent,  then  it  is  pretty  likely 
that  the  track  is  correct.  We  will  refer  to  this  characteristic  as  approximate  monotonicity.  We  have 
formalized  this  notion  in  terms  of  probability  distributions  for  the  correct  global  probability  of  a 
hypothesis  in  terms  of  its  belief  based  on  local  agent  data  only.  A  key  focus  of  our  future  research 
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will  be  to  use  this  concept  to  design  a  coordination  strategy  whose  effects  are  predictable  in  terms 
of  the  probability  of  its  global  solution  being  optimal. 

7.4  Theorem  3 

In  this  section  and  the  next,  we  will  consider  the  use  of  another  approximate  GSOUs  resolution 
strategy,  one  that  is  incremental  in  the  sense  that  the  completeness  with  which  the  GSOUs  are 
resolved  is  not  prespecified  but  depends  on  the  situation.  The  termination  criteria  will  now  be 
phrased  in  terms  of  whether  a  solution  is  acceptable  or  not.  We  define  the  predicate  acceptable{X{C)) 
to  mean  that  V/i  G  1(0)^  BEL(h)  >  AT,  where  AT  is  the  acceptance  threshold  {Q  <  AT  <  1).  Based 
on  this  definition  we  define  the  incremental  GSOUs  resolution  strategy  as:  each  agent  will  resolve 
the  GSOUs  as  in  the  local  solutions  GSOUs  resolution  strategy^  but  if  its  resulting  local  solution 
is  not  acceptable  it  must  continue  to  resolve  global  SOUs  associated  with  the  hypotheses  in  its 
solution  until  the  solution  is  judged  acceptable  or  all  the  relevant  global  SOUs  have  been  resolved 
(i.e.,  as  when  using  the  complete  GSOUs  resolution  strategy). 

In  this  section  we  will  assume  that  the  agents  use  the  exact  local  interpretation  strategy.  For 
the  centralized  case,  this  means  that  the  final  context  refers  to  the  complete  data  set  P,  and 

that  T{Cj^)  =  I{V),  Because  we  are  now  considering  a  satisficing  approach  to  solution  selection, 
instead  of  comparing  the  centralized  and  distributed  solutions  based  on  whether  they  are  idential, 
we  will  judge  them  based  on  whether  they  are  acceptable. 

Theorem  3:  Given  the  exact  local  interpretation  strategy  and  the  incremental  GSOUs  reso¬ 
lution  strategy,  VP  if  acceptable{i{CfJ)  then  acceptable{i^(Cfj),  where  and  Cf^  are  final 
contexts  that  meet  the  specified  centralized  and  distribute^  termination  criteria  for  the  P,  respec¬ 
tively.  In  other  words,  if  a  centralized  SI  system  is  able  to  produce  an  Acceptable  solution  for  a 
data  set  then  so  should  a  distributed  DRESUN  system  using  the  specified  strategies. 

Proof;  Assume  that  the  theorem  is  false.  This  means  that  even  though  acceptable{i{C 

E  such  that  BEL^ {h,Cj^)  <  AT.  Now,  because  acceptable{i{C jf)),  3{hj}  C  P(C/J 

such  that  explains*{support-of*{h),{hj})  and  \/hj  G  {hj},BEL{hj)  >  AT.  In  other  words,  the 
centralized  system  has  one  or  more  hypotheses  that  are  part  of  its  solution,  that  have  acceptable 
belief  ratings,  and  that  explain  the  data  underlying  the  unacceptable  hypothesis  h  in  the  global 
interpretation.  Since  these  hypotheses  are  alternative  explanations  for  support- of*{h),  each  of  these 
hypotheses  would  also  have  been  created  by  at  least  one  of  the  agents  that  produced  h.^^  So,  for 

Actually,  the  exact  same  hypothesis  may  not  have  been  produced  by  a  single  agent  since  each  of  the  agents  may 
have  only  a  portion  of  the  data  necessary  to  produce  the  version  of  the  hypothesis  with  the  most  complete  support 
possible.  This  issue  arises  because  interpretation  problems  are  not  propositional,  so  as  additional  evidence  is  generated 
for  a  hypothesis  the  evidence  both  modifies  the  hypothesis’  belief  and  refines  the  values  of  its  attributes — creating 
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Agent  3 


Agent  4 


Figure  4:  An  example  of  the  effect  of  partial  resolution  of  the  global  SOUs. 

Here,  resolution  of  a  GSOU  btween  agents  2  and  3  produced  the  hypothesis  /i,  which  resulted  in  {/i, being 
judged  as  an  acceptable  explanation  for  agents  2  and  3,  Because  of  this,  the  GSOUs  associated  with  h\  and  A2  were 
not  resolved.  However,  resolution  of  these  GSOUs  could  result  in  corroborating  evidence  being  obtained  from  agents 
1  and  4,  which  might  make  the  best  explanation. 


each  hj  there  is  a  corresponding  hypothesis  that  would  have  been  created  by  Now,  why  is 
_ _ ^ 

BEL  (/ly)  <  AT  when  BEL{hj)  >  AT?  There  are  three  possibilities:  (1)  there  exists  data  in  one 
or  more  agents  other  than  Ak  that  would  corroborate  but  this  data  was  not  propagated  to 
(2)  there  exists  data  in  one  or  more  other  agents  that  would  contradict  one  or  more  alternatives 
to  (3)  both  of  the  above  may  be  true.  Consider  case  1:  If  there  exists  data  in  other  agents 
that  could  support  /ij,  then  Ajt  would  have  had  GSOUs  associated  with  hj  that  identified  the 
possible  existence  of  this  data  (since  GSOUs  identify  all  interrelationships  among  the  agents  given 
the  constructed  hypotheses),  but  these  SOUs  were  not  resolved.  This  leads  to  a  contradiction  with 
our  specified  coordination  strategy,  however,  since  these  SOUs  should  have  been  pursued  when  no 
acceptable  explanation  was  found  for  the  data  common  to  h  and  hj  (remember  that  h  G  T^(C^) 

but  BEL  (/i,C^)  <  AT).  Cases  2  and  3  lead  to  similar  contradictions. 

Thus,  given  the  specified  strategies  if  a  centralized  interpretation  system  can  find  an  interpreta¬ 
tion  above  a  particular  threshold  then  so  can  a  DRESUN-based  distributed  interpretation  system. 
This  is  a  somewhat  weak  result,  but  it  serves  to  establish  a  basic  capability  of  the  DRESUN  archi¬ 
tecture.  Other  FA/ C  distributed  problem-solving  architectures  have  not  had  this  property  because 
they  have  lacked  a  representation  of  the  inter-agent  constraints. 

It  is  important  to  note  that  while  we  have  shown  that  a  distributed  DRESUN  system  would 
find  an  ‘“acceptable”  solution,  this  interpretation  may  not  be  identical  to  that  produced  by  the 

new  versions  of  the  hypothesis  (what  [2]  terms  extensions).  Thus,  what  we  really  mean  is  that  a  precursor,  partial 
version  of  each  alternative  hypothesis  would  have  been  created  by  one  or  more  of  the  agents.  This  is  true  because 
interpretation  hypotheses  can  be  created  based  on  incomplete  support,  and  our  specified  strategy  was  for  each  agent 
to  create  all  the  possible  interpretations  that  are  (even  partially)  supported  by  its  own  local  data. 

The  data  cannot  be  in  Ak  since  we  have  assumed  complete  intra-agent  propagation. 
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centralized  system  and  may  not  be  the  MPE  of  the  global  data  set.  Because  an  acceptable  solution 
might  be  found  without  all  the  GSOUs  having  been  resolved,  there  could  have  been  incomplete 
evidence  propagation  among  the  agents.  As  a  result,  hypothesis  belief  ratings  would  be  only 
approximations  of  the  true  conditional  probabilities  and  this  could  lead  to  alternative  explanations 
being  (incorrectly)  rated  more  highly— see  Figure  4.  This  is  the  reason  that  the  proof  used  BEL{h) 

^ ^  -  Q 

for  the  centralized  system,  but  BEL  (h.)  for  the  distributed  system.  As  Theorems  1  and  2  show, 
the  only  way  to  guarantee  that  a  distributed  system  produces  a  solution  that  is  identical  to  that  of 
the  centralized  system  (and  is  the  MPE  of  the  globally  available  data)  is  to  require  that  all  of  the 
GSOUs  associated  with  every  hypothesis  in  each  agent  be  completely  resolved. 

7.5  Theorem  4 

In  this  section  we  wiU  again  examine  the  nature  of  global  solutions  when  using  the  incremental 
GSOUs  resolution  strategy.  However,  unlike  in  Theorem  3,  here  we  will  assume  that  the  agents  use 
an  approximate  local  interpretation  strategy.  This  means  that  agents  do  not  necessarily  construct 
all  possible  interpretation  hypotheses  and  so  may  compute  only  approximate  belief  ratings  (condi¬ 
tional  probabilities)  for  the  hypotheses.  Analyzing  the  solution  quality  given  incomplete  evidence 
propagation  is  complicated  by  the  fact  that  a  system’s  solutions  can  depend  on  the  details  of  the 
strategies  being  used.  We  are  not  yet  at  the  point  where  we  can  formally  define  and  analyze  the 
effects  of  different  strategies.  Thus,  we  will  not  specify  a  particular  strategy  for  controlling  evidence 
propagation  here.  Instead,  we  will  simply  assume  that  the  same  strategy  is  used  in  the  single  agent 
of  the  centralized  case  and  in  all  of  the  agents  of  the  distributed  case.  We  wiU  continue  to  assume 
that  all  of  the  data  directly  available  to  each  agent  is  (at  least  partially)  processed  by  that  agent. 

Theorem  4:  Given  an  approximate  local  interpretation  strategy  and.the  incremental  GSOUs 
resolution  strategy,  it  is  not  the  case  that,  VP  if  acceptable{i{C fj)  then  acceptable{i°{Cfj), 

where  and  Cf^  are  final  contexts  that  meet  the  specified  centralized  and  distributed  termination 
criteria  for  the  data  set  V,  respectively.  In  other  words,  even  if  a  centralized  system  is  able  to 
produce  an  acceptable  solution  for  a  data  set,  a  distributed  DRESUN  system  may  not  (even  when 
the  same  local  interpretation  strategy  is  being  used). 

Proof:  Assume  that  the  theorem  is  false:  VP  acceptable{X{C jfif)  implies  acceptable{i^ 

It  is  easy  to  show  the  general  existence  of  counter  examples  (the  counter  example  is  illustrated 
in  Figure  5).  Suppose  that,  the  centralized  system,  which  has  a  view  of  the  complete  data  set 
Pi  U  P2  U  P3  U  P4,  initially  chooses  to  construct  hypotheses  hi  and  /12.  If  this  is  an  acceptable 
explanation  {BEL{hi)  a'Hd  BEL{h2)  >  AT)  no  further  propagation  may  be  done.  In  the  distributed 
case,  the  agents  might  initially  produce  the  hypotheses  h^  and  h4  (based  on  their  own  local  data). 
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Centralized 


Distributed 


Figure  5:  A  counter  example  for  the  proof  of  theorem  4. 


In  general,  there  will  certainly  exist  data  scenarios  for  which  this  interpretation  would  not  be 
acceptable.  However,  even  if  the  agents  communicate  and  construct  hi  and  /12,  BEL(hi)  and 
BEL{h2)  may  be  different  than  the  approximate  values  computed  in  the  centralized  case,  and 
could  in  fact  lead  to  this  alternative  interpretation  also  being  judged  unacceptable. 

The  reason  that  an  acceptable  centralized  solution  does  not  guarantee  an  acceptable  distributed 
solution  is  that  the  division  of  data  among  the  distributed  agents  may  lead  to  the  construction  of 
a  different  set  of  hypotheses  from  those  constructed  by  the  centralized  system.  The  construction 
of  different  hypotheses  produces  different  belief  approximations.  This  could  cause  the  distributed 
system  to  produce  a  different  acceptable  solution  or  it  could  cause  it  to  fail  to  find  any  acceptable 
solution.  Even  if  the  distributed  system  eventually  constructs  the  same  interpretation  hypothe¬ 
ses  as  the  centralized  system,  it  may  have  done  additional  evidence  propagation  that  causes  the 
interpretation  to  be  judged  unacceptable.  Furthermore,  just  because  the  distributed  system  may 
be  driven  to  do  more  evidence  propagation  than  the  centralized  system,  this  does  not  mean  that 
its  best  solution  is  more  likely  to  be  the  (true)  global  MPE  (as  long  as  its  propagation  is  stiU 
incomplete). 

7.6  Theorem  4  Bounds 

Another  way  of  stating  theorem  4  is  that  when  approximate  GSOUs  resolution  strategies  are  used, 
there  is  merely  some  probability  of  a  distributed  system  producing  an  acceptable  solution  when 
an  acceptable  solution  is  produced  by  an  equivalent  centralized  system.  This  probability  is  a 
function  of  the  interpretation  and  coordination  strategies  being  used,  the  acceptance  threshold. 
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the  characteristics  of  the  domain,  and  the  organization  of  agent  interest  areas.  Understanding 
the  effects  that  these  factors  have  on  the  probability  of  achieveing  comparable  global  solutions  is 
a  major  focus  of  our  future  research.  As  a  first  step,  we  have  explored  how  to  decompose  this 
probability  so  that  we  could  determine  bounds  on  its  components  based  on  reasonable  probabilistic 
characterizations  of  a  control  strategy  and  domain. 

So  what  we  want  to  determine  is:  P(acceptable{X^ \  acceptable{I{C fj))^  where  Cj^  and 
are  final  contexts  that  meet  the  specified  centralized  and  distributed  termination  criteria  for 
the  I>,  respectively.  (For  simplicity,  we  will  use  for  X^{C^^)  and  X^  for  X{CfJ  from  this  point 
on.)  One  way  to  decompose  the  probability  is: 

P{acceptable(X^)  \  acceptabl€{X^))  = 

P(acceptable{X^)  |  accept able{X^)  AX^  =  X^)  +  P{X^  —  X^  \  accept able{X^))  + 
P{acceptable{X^)  \  accept able{X^)  AX^  ^  X^)  *  P{X^  ^  X^  |  acceptable{X^)) 

Here,  we  have  decomposed  the  situation  based  on  whether  the  distributed  system  finds  that  the 
most  likely  interpretation  is  the  same  interpretation  as  was  found  by  the  centralized  system  or 
not  (remember  that  though  the  distributed  system  may  have  produced  the  same  interpretation 
as  the  centralized  system  it  may  have  done  different  propagation,  so  its  belief  ratings  for  the 
hvuotheses  of  the  interpretation  may  be  different  from  those  of  the  centralized  system).  Based  on 
tiiis  decomposition,  it  is  possible  to  identify  some  bounds  on  the  component  probabilities. 

Consider  the  first  conditional  probability  term  in  the  decomposition: 

P{acceptable{X^)  |  accept able{X^)  A  X^  =  I^)  > 

P(V/ii  6  X^,BEL{hi)  >  AT  \  acceptable{X^)) 

Proof:  To  prove  this,  we  need  to  show  that  accept able{X^)  A  (Vfii  G  X^^BEL{hi)  >  AT)  im¬ 
plies  acceptable{X^)  (when  X^  =  J^).  Assume  that  this  is  false,  so  that  accept able{X^)  A 
(Vfi,-  G  X^ ^  BEL{hi)  >  AT)  A  ^ accept able{X^).  Based  on  the  specified  propagation  strategy, 
-~^acceptable{X^)  would  cause  the  distributed  system  to  do  further  evidence  propagation  un¬ 
til  either  acceptable(X^) — which  would  contradict  the  assumption — or  complete  propagation  has 
been  done  for  all  hypotheses  in  X^  that  are  not  acceptable — which  also  leads  to  a  contradic¬ 
tion,  as  we  will  show.  The  complete  propagation  situation  means  that  G  X^  such  that 

BEL  (hi)  <  AT ^  BEL  (hi)  ~  BEL^(hi),  However,  this  contradicts  the  premise  assumptions 
that  Mhi  G  BEL(hi)  >  AT  since  X^  =  X^  here. 
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P{yhi  E  1^5  BEL{hi)  >  AT  |  acc€ptable{I^))  is  the  probability  that  the  hypotheses  in  the  cen- 
tralized  interpretation  are  “acceptable”  given  these  hypotheses  true  belief  ratings  whenever  the  par- 
tial  propagation  strategy  finds  that  the  hypotheses  are  acceptable  given  the  approximate  beliefs. 
This  probability  could  be  determined  experimentally  for  domains  in  which  complete  propagation 
is  at  least  possible  (even  if  it  is  not  possible  for  real-time  problem  solving),  and  could  be  exper¬ 
imentally  estimated  for  domains  in  which  complete  propagation  is  absolutely  intractable.  It  may 
also  be  possible  to  directly  derive  this  probability  from  appropriate  domain  and  strategy  models. 
This  would  be  easier  than  trying  to  derive  P {accept able{I^)  \  accept able{i^)  A  =  1^)  since  it 
would  not  require  modeling  the  relationship  between  the  organization  of  agent  interest  areas  and 
the  uncertainty /incompleteness  of  the  locally  available  data,  and  how  such  conditions  interact  with 
a  particular  control  strategy. 

For  the  corresponding  conditional  probability  term  in  which  there  is  the  following 

bound: 


P{acceptable{i^)  |  acceptable{i^)  Al^  ^  1^)  > 

/  i^)  ^  {\fhi  e  Ij,  BEL{hi)  >  AT)  I  acceptable{i^)  A  Ij  1^) 

Proof:  The  proof  is  analogous  to  the  last  proof. 

^  (Vhi  G  Ij^BEL{hi)  >  AT)  \  acceptable{T^)  A  Ij  ^  X^)  is  the  probability 
that  all  interpretations  of  the  data  other  than  the  most  likely  interpretation  have  hypotheses  whose 
true  belief  ratings  would  make  them  “acceptable”  interpretations,  given  that  the  most  likely  inter¬ 
pretation  has  been  found  to  be  acceptable  using  a  partial  propagation  strategy.  While  this  does 
provide  a  lower  bound  on  the  component  conditional  probability,  it  is  a  weak  bound  because  it  is 
unlikely  to  be  very  large  for  reasonable  values  of  the  acceptance  threshold  AT.  In  the  future  we  will 
be  working  to  improve  this  bound  by  exploring  how  different  strategies  affect  the  likelihood  of  locat¬ 
ing  acceptable  interpretations  when  such  interpretations  exist.  This  will  also  assist  in  characterizing 
the  relative  likelihood  that  the  distributed  system  will  find  the  same  or  different  interpretations 
than  the  centralized  system — a  key  issue  in  assessing  the  target  conditional  probability. 

8  Conclusions 

This  paper  began  with  a  substantial  discussion  of  SI  and  the  kinds  of  interpretation  and  coordination 
strategies  that  can  be  used  for  FA/C  distributed  SI.  Several  basic  results  about  the  quality  of 
solutions  that  can  be  produced  by  an  FA/C-based  distributed  SI  system  were  then  presented. 
First,  while  it  is  possible  for  a  distributed  SI  system  to  produce  the  same  solution  as  a  centralized 
system  (and  the  true  MP E  of  the  available  data)  this  requires  strategies  that  are  typically  impossible 
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for  real-world  SI.  Second,  while  coordination  that  focuses  only  on  local  solutions  has  been  popular 
in  FA/C  systems  and  can  be  efficient,  it  is  not  guaranteed  to  produce  centralized-equivalent  or 
optimal  solutions.  Finally,  if  a  satisficing  approach  to  constructing  solutions  is  adopted  then  an 
FA/C-based  system  can  produce  results  that  are  '‘comparable”  to  those  of  a  centralized  system, 
but  again  only  under  conditions  that  are  typically  impractical. 

Our  future  research  will  concentrate  on  pursuing  the  issues  that  appear  in  Sections  7.3  and  7.6. 
This  is  will  involve  characterizing  the  effects  of  practical  approximate  FA/C  strategies  for  SI,  based 
on  probabilistic  models  of  the  domains.  We  believe  that  results  based  on  the  notion  of  approximate 
monotonicity  will  support  some  of  the  intuitions  that  people  have  had  about  approximate  FA/C- 
based  SI:  certain  types  of  hypotheses  are  better  to  construct  and  communicate,  and  it  is  useful  to 
achieve  a  certain  local  level  of  belief  in  hypotheses  before  communicating  them.  In  each  of  these 
rases  we  see  that  the  likelihood  of  global  solutions  being  correct  when  approximate  strategies  are 
used  wiU  be  improved. 

As  we  pointed  out  in  the  introduction,  different  coordination  strategies  for  how  and  when  to 
communicate  the  information  necessary  to  resolve  a  GSOU  can  greatly  affect  the  efficiency  of  the 
process.  This  issue  is  related  to  the  question  of  local  and  global  strategy  interactions.  For  example, 
suppose  that  an  agent  wants  to  resolve  a  certain  GSOU,  but  the  appropriate  external  agent  has 
not  yet  processed  (interpreted)  the  data  and  created  the  hypotheses  necessary  to  do  this.  Should 
the  initiating  agent  wait  for  the  external  agent  to  get  around  to  doing  this?  Should  it  instead  be 
sent  the  raw  data  and  do  with  it  what  it  wants,  even  if  that  might  produce  a  different  result  than 
would  have  been  produced  in  the  first  scenario?  Since  our  focus  in  this  paper  is  on  solution  quality 
that  can  attained  by  FA/C  distributed  SI  systems,  we  tried  to  separate  the  effects  of  the  local  and 
global  strategies  as  much  as  possible.  In  practice,  however,  their  interactions  can  have  a  profound 
effect  on  efficiency  and  solutions.  These  are  other  issues  that  we  will  be  pursuing  as  part  of  our 
efforts  to  analyze  the  FA/C  model. 
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Abstract 

This  paper  presents  the  IDP  formalism  for  representing  and  analyzing  complex 
interpretation  problem  domains.  The  formalism  is  being  used  to  analyze  the 
relationship  between  the  performance  of  search-based  interpretation  problem 
solving  systems  and  the  inherent  properties,  or  structure,  of  problem  domains 
in  which  they  are  applied.  Models  built  using  this  formalism  can  be  used  for 
describing,  predicting  and  explaining  the  behavior  of  interpretation  systems 
and  for  generalizing  a  specific  problem  solving  architecture  to  other  domains. 
Examples  demonstrate  how  domain  phenomena  such  as  noise  and  missing 
data  are  represented,  how  non-local  relationships  between  subproblems  can  be 
modeled,  and  how  quantitative  properties  of  search  spaces  can  be  calculated. 
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1  Introduction 


In  this  paper,  we  present  formal  methods  for  characterizing  the  structure  of  inter¬ 
pretation  problem  domains’  and  their  associated  problem  solving  architectures.  Our 
primary  assumptions  are  that  the  performance  of  an  interpretation  problem  solver 
applied  in  a  specific  domain  is  a  function  of  the  structure  of  the  problem  domain  and 
that  effective  problem  solving  techniques  can  be  derived  from  a  formal  specification 
of  a  problem  domain.  We  define  interpretation  problems  as  a  search  process  where, 
given  an  input  string  X,  a  problem  solver  attempts  to  find  the  most  credible  (or  “best”) 
explanation  for  X  from  the  set  /  of  all  possible  interpretations  (i.e.,  an  interpretation 
problem  is  a  discrete  optimization  problem).  Interpretation  is  a  form  of  constructive 
problem  solving  based  on  abductive  inferencing.  Interpretation  problems  are  similar 
to  classification  problems  and  to  a  more  distant  form  of  problem  solving,  parsing. 
In  interpretation  problems,  the  set  I  is  constructed  dynamically.  This  is  in  contrast 
to  classification  problems  where  /  is  preenumerated  and  the  problem  solving  task 
involves  only  the  identification  of  the  best  element  of  /  [2]. 

For  many  domains,  a  distinct  I  can  be  specified  in  a  naturally  structured  way 
and  it  is  this  structure  that  is  exploited  by  control  architectures  to  reduce  the  number 
of  elements  of  I  that  must  be  generated  or  to  otherwise  increase  the  efficiency  with 
which  /  is  specified.  Of  particular  interest  to  us  are  situations  where  I  is  finite  and 
can  be  defined  as  the  language  generated  by  a  grammar,  G,  and  where  the  evaluation 
function,  /,  used  during  problem  solving  is  recursively  defined  for  interpretation 
trees  z  in  /.  In  these  situations,  interpretations  take  the  form  of  derivation  trees  of  X 
and  the  constructive  search  operators  used  in  interpretation  problems  are  viewed  as 
production  rules  of  G.  G,  therefore,  defines  how  interpretations  are  decomposed.  For 
example,  the  production  rule p  nin2n3  might  correspond  to  the  interpretation  “p  is 
composed  of  an  n\,  an  n2,  and  an  n^.”  Given  an  ni,  an  n2,  or  an  n^,  a  problem  solver 
may  invoke  the  search  operator  Op  to  try  and  generate  a  p.  Each  syntactic  rule  of 
the  grammar  is  associated  with  a  corresponding  semantic  process  which  determines 
the  actual  “meaning”  of  an  interpretation.  This  representation  is  similar  to  that  used 
in  the  Composite  Decision  Process  (CDP)  model  of  Kanal  and  Kumar  [6]  and  the 
semantic  grammars  used  by  Fu  [4]. 

Based  on  this  definition  of  interpretation,  we  define  the  Interpretation  Decision 
Problem  (IDP)  formalism.  The  formalism  comprises  two  components,  one  for  mod¬ 
eling  the  domain  phenomena  that  generate  a  specific  problem  instance  and  one  for 
modeling  the  structure  of  an  interpretation  problem  solver.  The  generational  com¬ 
ponent  of  the  IDP  formalism  will  be  referred  to  as  IDP(j  and  the  interpretation 
component  will  be  referred  to  as  IDP/  .  The  primary  focus  of  this  paper  will  be  on 
IDPg  and  the  details  of  IDP/  are  left  to  [7].  The  IDPg  formalism  models  interpre- 
tation  domain  theories^  in  terms  of  four  feature  structures:  component  (or  syntax), 

‘intuitively,  interpretation  problems  are  tasks  where  a  stream  of  input  data  is  analyzed  and  an 
explanation  is  postulated  as  to  what  domain  events  occurred  to  generate  the  signal  data  -  the  problem 
solver  is  attempting  to  interpret  the  signal  data  and  determine  what  caused  it  [3]. 

A  domain  theory  is  the  computational  theory  that  is  the  basis  for  a  problem  solver’s  functionality. 
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utility  (or  credibility^ ,  probability  (or  distribution),  and  cost.  The  different  feature 
structures  are  represented  in  terms  of  a  domain  grammar  and  functions  associated 

with  production  rules  of  the  grammar.  The  formal  definition  of  an  IDP  grammar 
follows. 


Definition  1.1  An  IDP  Grammar  is  a  grammar,  Gi  -<  V,  N,SNT,  S,  P  >,  where 
V  is  the  set  of  terminal  symbols,  N  is  the  set  of  nonterminal  symbols,  SNT  is  the 
set  of  solution-nonterminal'^  symbols  that  correspond  to  final  states  in  the  associated 
interpretation  search  space,  S  is  the  start  symbol  for  the  grammar,  and  P  is  the  set  of 
context-free  production  rules. 

The  component  structure  is  modeled  directly  by  the  rules,  P,  of  G.  The  component 
subproblems  of  A  are  defined  by  the  right-hand-side  (RHS)  of  a  rule  of  the  grammar, 
p  .  A  -^(component  subproblems).  The  full  structure  is  specified  recursively  and 
subproblems  specified  by  the  nonterminals  and  terminals  of  a  grammar  correspond 
to  states  in  the  associated  search  space. 

The  credibility  structure  is  defined  recursively  in  terms  of  credibility  functions 
associated  with  G’s  production  rules.  Credibility  functions  include  consideration 
of  the  credibilities  of  the  component  elements  and  semantics.  Given  p.  A  — >  CD, 
the  credibility  of  “A,”  /a,  is  a  function  of  the  credibilities  of  “C”  and  “D”  and  the 
semantic  function,  T ,  that  measures  the  degree  to  which  “C”  and  “D”  are  semantically 
consistent.  Semantics  associated  with  each  production  rule  determine  the  actual 
domain  interpretation.  The  semantic  functions  will  typically  make  use  of  grammar 
e  ement  attributes  that  are  not  used  by  syntactic  functions.  These  attributes  are 
represented  using  the  feature  list  convention  described  by  Gazdar,  et  al.  [5],  For 
example,  a  semantic  process  in  an  acoustic  vehicle  tracking  system  might  combine 
two  partial  vehicle  tracks  into  a  longer  track  using  a  complex  process  that  verifies  the 
consistency  of  the  frequency  characteristics  of  the  partial  tracks  and  the  properties  of 
the  resulting  track  such  as  acceleration  and  velocity  constraints  based  on  time/location 
attributes  of  the  tracks.  Credibility  functions  arejepresented  /A(/c,/D,r  (C  D)) 
where  the  subscript  of  T,  p,  is  the  number  of  the  corresponding  production  rule  from 
the  grammar.  In  a  natural  language  interpretation  system,  a  semantic  function  might 

determine  the  degree  to  which  the  combination  of  a  noun  phrase  and  a  verb  phrase  is 
meaningful. 


Similarly,  computational  cost  structures  are  recursive  functions  of  component 
elements  and  the  cost  of  applying  semantic  operators.  For  example,  given  p.  A  -> 
CD,  the  cost  of  generating  “A,”  pa,  is  a  function  of  the  cost  of  generating  “C”  and 
U  and  tl^  cost  of  applying  the  semantic  function  rp(C,D).  This  is  represented 
^a(pc,Pd,  (rp(C,D))),  where  C{rp{i,j, ...))  is  the  cost  of  applying  the  semantic 

functinn  Fn 


In  [7],  crediWity  structures  are  formally  linked  to  the  semantics  associated  with  full  and  partial 
m  erpretations.  Thus,  a  full  or  partial  interpretation  that  has  a  high  credibility  can  intuitively  be 

hara^low  credWrf  ^"^^ntic  interpretation  and  a  full  or  partial  interpretation  that 

interpretltior  '"fu't'vely  be  thought  of  as  having  an  inconsistent  or  incomplete  semantic 

th^  SNT  ^  formalism,  a  potential  solution  is  referred  to  as  an  SNT.  The  uses  and  advantages  of 
the  SNT  representation  are  discussed  in  [7].  ^ 
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The  probability  structure  is  defined  in  terms  of  the  function,  ijj.  For  each  p  G 
P,  represents  the  distribution  of  RHSs  associated  with  p.  This  can  be  used 
both  to  generate  problem  instances,  in  which  case  V’  models  the  likelihood  of  certain 
events  happening  in  the  environment,  and  to  specify  the  actions  available  to  a  problem 
solver,  in  which  case  ^  models  the  problem  solver’s  perception  of  and  assumptions 
about  the  domain.  For  production  p  which  decomposes  to  RHS|,  RHS2 . . .  RHS^, 
V>(p)  models  the  frequency  distribution  with  which  these  rules  are  used  to  generate 
problem  instances.  The  frequency  with  which  the  rules  are  used  in  interpretation 
is  a  function  of  this  distribution.  Note  that  Yin  0(p-^)  =  1.  supports  models  of 
real-world  phenomena  such  as  uncertainty  caused  by  noise,  missing  data,  distortion 
and  masking,  as  will  be  discussed  in  Section  2. 

To  our  knowledge,  this  formalism  is  unique  and  it  offers  significant  contributions 
to  the  study  of  interpretation  problems.  The  most  important  is  that  the  IDP  formalism 
supports  the  analysis  of  the  assumptions  (implicit  and  explicit)  that  a  problem  solver 
makes  about  a  domain.  For  example,  using  the  IDP  formalism,  it  is  possible  to 
analyze  a  problem  solver’s  a  priori  expectations  about  a  domain.  By  analyzing  the 
statistical  properties  of  the  interpretation  trees  of  a  domain  grammar,  it  is  possible  to 
determine  characteristics  that  can  be  used  during  problem  solving,  such  as  a  priori 
expected  distributions  of  domain  events  that  can  be  used  in  model-based  processing. 
Furthermore,  this  analysis  can  also  identify  patterns  and  structures  that  can  be  used 
to  construct  effective  problem  solving  strategies  [7].  In  addition,  the  IDP  formalism 
supports  comparative  experiments  and  analyses  involving  different  problem  solvers 
applied  in  the  same  domain  and  the  same  problem  solver  applied  to  different  domains. 
Consequently,  this  work  represents  an  initial  step  needed  to  formalize  complex  search 
processes,  such  as  those  associated  with  blackboard  systems  [1]  that  are  used  in 
sophisticated  interpretation  systems. 

The  next  section  demonstrates  how  the  IDP^  formalism  can  represent  domain 
phenomena  such  as  noise  and  missing  data.  Section  3  describes  quantitative  results 
that  can  be  derived  using  the  formalism  and  presents  examples  of  how  these  can  be 
used  to  define  general  domain  structures. 

2  Representing  Problem  Domains 

The  quality  and  effectiveness  of  analysis  tools  derived  from  a  formal  model  of  a 
problem  domain  are  dependent  on  the  degree  to  which  the  formalism  adequately  rep¬ 
resents  the  phenomena,  and  their  causes,  characteristic  of  a  domain.  For  interpretation 
domains,  a  modeling  formalism  must  deal  effectively  with  interacting  phenomena, 
noise,  missing  data,  distortion,  and  masking  [3].  In  this  section,  we  will  discuss  how 
the  IDPg  formalism  addresses  these  issues. 

The  interpretation  domain  which  we  will  study  in  detail  is  a  vehicle  tracking 
problem  based  on  acoustic  sensor  data  as  defined  in  our  previous  work  on  RESUN 
[2].  The  problem  solver’s  input  consists  of  preprocessed  sensor  data  gathered  from  a 
single  contiguous  region.  The  problem  solver  then  processes  the  data  in  an  attempt 
to  identify  the  type  of  vehicle  that  generated  the  signals  and  the  path  it  traversed 
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through  the  region.  For  the  vehicle  tracking  domain,  a  modeling  formalism  must  also 
represent  multiple  events  (i.e.,  multiple  vehicle  tracks)  and  patterned  events  (vehicles 
moving  in  a  coordinated  manner). 

In  order  to  model  the  genertion  of  a  problem  instance,  the  IDPg  simulator  begins 
by  creating  a  feature  list  for  the  start  symbol,  S.  This  list  includes  credibility,  vehicle 
heading,  velocity,  type,  etc.  Next,  the  generator  probabilistically  chooses  a  production 
rule,  p,  associated  with  the  start  symbol  and  applies  p  to  the  start  symbol,  i.e.,  the 
generator  randomly  chooses  a  production  rule  with  the  start  symbol  on  the  left-hand- 
side  and  replaces  S  with  the  right-hand-side  of  the  rule.  The  choice  of  which  rule 
to  apply  is  made  based  on  the  weights  of  the  ^  associated  with  the  different  rules. 
As  the  generator  replaces  S,  it  assigns  each  of  the  replacing  symbols  a  feature  list. 
Each  of  the  feature  lists  is  determined  by  applying  p’s  feature  list  functions  to  S’s 
original  feature  list.  The  new  symbols,  each  with  their  own,  unique  feature  list,  are 
added  to  either  a  queue,  if  the  symbol  is  a  nonterminal,  or  an  output  set,  if  the  symbol 
is  a  terminal,  and  the  whole  process  is  repeated.  The  first  element  of  the  queue  is 
removed  and  processed,  and  the  resulting  new  symbols  are  added  to  the  tail  of  the 
queue  or  to  the  output  set.  This  continues  until  the  queue  is  empty  and  the  output  set 
contains  terminals  with  fully  specified  feature  lists. 

The  basic  structure  of  the  vehicle  tracking  problem  domain  consists  of  scenarios. 
Each  scenario  consists  of  a  number  of  tracks.  Each  track  can  represent  a  single 
vehicle  traversing  the  region  (an  1-Track),  a  vehicle  and  a  corresponding  ghost  track 
(a  G-Track),  or  a  group  of  vehicles  moving  in  a  coordinated  manner  (a  P-Track).  Each 
of  these  tracks  is  composed  of  vehicle  locations,  Vn,  where  n  is  a  vehicle  identifier, 
(e.g.,  VI  is  a  vehicle  1  location  at  time  t.)  Vehicle  locations  are  composed  of  group 
classes  and  group  classes  consist  of  signal  classes.  The  signal  classes  can  be  thought 
of  as  preprocessed  signal  data. 

A  grammar  that  will  generate  problem  instances  for  this  domain  is  shown  in 
Fig.  1.  These  figures  show  the  production  rules  of  the  grammar  and  the  probability 
distribution  values  for  xJj  associated  with  each  of  the  rules.  The  feature  lists  are  shown 
as  bracketed  subscripts.  In  this  example,  “f”  represents  the  characteristics  time,  x 
and  y  coordinates,  velocity  (v),  acceleration  (a),  offset,  and  energy.  Velocity  is  used 
to  generate  the  x  and  y  coordinates  of  the  next  location  in  the  track.  Acceleration  is 
used  to  adjust  velocity  over  time.  Velocity  and  acceleration  are  also  used  to  constrain 
the  progression  of  a  track.  For  example,  each  particular  kind  of  vehicle  track  has 
a  maximum  velocity  and  acceleration  that  cannot  be  exceeded.  Offset  represents 
a  displacement  from  a  “base”  position.  This  is  used  to  generate  pattern  tracks  of 
multiple  vehicles  and  ghost  tracks  [7].  Figure  1  explicitly  shows  how  feature  lists 
are  modified  for  the  different  grammar  rules.  For  example,  rule  10  indicates  that  the 
generation  of  two  track  locations  from  a  pattern  track  location  involves  adding  the 
offset  to  the  x  and  y  locations  of  track  location  Tl.  Similarly,  the  generation  of  a  T1 
track  location  from  an  I-Trackl  in  rule  4  also  involves  generating  a  “new”  I-Trackl 
for  the  next  time  quantum  (t+1)  with  new  x  and  y  positions  (x+V+A,  y+V-t-A).  Many 
rules  include  a  RHS  of  A.  This  indicates  that  there  is  a  possibilty  that  the  production 
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Figure  2:  Vehicle  Tracking  Scenario  Examples 
rule  will  not  lead  to  the  production  of  signal  data. 

Rules  1  through  9  are  used  to  generate  the  high-level  track  phenomena.  The 
recursive  rule  number  2  will  generate  a  number  of  track  phenomena  for  each  scenario. 
By  adjusting  the  probabilities  for  the  two  alternative  RHSs,  this  rule  can  be  tailored 
to  generate  any  number  of  tracks.  If  the  probability  of  the  first  RHS  is  close  to  1, 
this  rule  will  generate  many  tracks.  If  it  is  close  to  0,  this  rule  will  generate  only  one 
track.  Rules  4  through  9  generate  the  time/vehicledocations  for  each  track^. 

Simple  scenario  examples  are  shown  in  Fig.  2.  In  all  these  examples,  the  tracks 
shown  move  from  left  to  right.  This  is  a  convention  used  for  presentation  clarity 
only.  The  actual  grammar  can  generate  tracks  that  originate  anywhere  on  the  sensed 
region’s  perimeter  and  at  any  point  in  time.  Thus,  if  an  experimental  run  is  to  simulate 
a  time  span  of  several  days,  the  vehicle  tracks  that  are  generated  can  begin  at  any 
point  in  time.  A  single  track  is  shown  in  Fig.  2.a.  Figure  2.b  is  an  example  of  multiple 
tracks  in  a  single  scenario.  A  pattern  track  is  shown  in  Fig.  2.c.  This  is  contrasted 
with  the  ghost  track  in  Fig.  2.d.  Notice  that  the  ghost  track  differs  from  the  pattern 
track  in  that  the  data  is  more  “spotty.”  There  is  more  missing  data  and  more  time 
instances  for  which  data  is  altogether  lacking. 

2.1  Interacting  Phenomena 

The  feature  list  convention  is  used  to  generate  pattern  and  ghost  tracks.  This  is  shown 
in  rules  6  through  9  and  rules  10  through  13.  These  rules  lead  to  domain  events 
^Note  that  these  rules  are  not  grounded.  They  continue  to  generate  data  until  some  predetermined 
time  limit  is  reached  or  until  the  data  generated  falls  outside  of  the  region  sensed  by  the  problem  solver. 
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consisting  of  two  tracks  with  closely  related  properties.  As  shown  in  Figures  2.c 
and  d,  ghost  tracks  and  pattern  tracks  move  in  coordination  with  each  other.  Ghost 
tracks  do  this  because  one  signal  is  a  reflection  of  a  true  signal  and  pattern  tracks 
do  this  because  they  are  composed  of  multiple  vehicles  moving  according  to  a  plan. 
For  example,  a  pattern  track  might  include  a  tanker  vehicle  and  a  second  vehicle 
following  it  while  refueling,  multiple  vehicles  moving  in  an  attack  pattern,  etc. 

Rules  6  through  9  generate  the  data  for  each  time-frame  of  the  tracks.  Rules 
10  through  13  generate  the  patterned  data.  The  vehicle  patterns  are  generated  using 
feature  lists  and  offsets.  Each  element  in  a  track,  which  will  be  referred  to  as  a  vehicle 
location,  has  an  x  and  a  y  coordinate.  As  seen  in  rule  6,  each  element  of  a  track  has  a 
basic  X  and  y  coordinate,  but  one  element  has  a  position  modified  by  a  variable  ojfset. 
The  offset  can  be  determined  by  an  arbitrarily  complex  function,  resulting  in  a  very 
expressive  technique  for  generating  coordinated  domain  events. 

This  is  just  one  example  of  how  the  feature  list  convention  can  be  used  to  generate 
domain  phenomena  that  are  coordinated  or  related  in  some  way  and  that  appear  to 
interact  through  time  and/or  distance.  This  is  a  very  important  point  because  it 
suggests  that,  in  some  situations,  it  is  possible  to  use  a  context-free  grammar  to 
generate  problem  instances  that  might  be  considered  context-sensitive.  The  power 
of  the  analysis  tools  presented  here  is  derived  in  large  part  from  their  context-free 
nature. 

2.2  Noise 

In  any  interpretation  domain,  noise  plays  a  significant  role  in  increasing  the  complex¬ 
ity  of  problem  solving.  This  is  the  result  of  noise  increasing  the  number  of  plausible 
interpretations  (i.e.,  increasing  the  ambiguity)  that  have  to  be  differentiated.  We  have 
developed  techniques  for  modeling  a  variety  of  different  types  of  noise  and  a  formal 
definition. 

Definition  2.1  Noise  -  A  grammar,  G,  is  subject  to  noise  iff 

3  a  rule  p  €  P,  where  p:  A  uBv  p’:  A  niun2Bn3Vn4,  for  u.,v  G  (V  U  N)*,  A 

G  N,  B  G  (V  U  N),  n,-  G  (V  U  N)*,  and  WuJbJMu,B,v))  > 

fa  5  fn^  ?  fv^  fn^  1  ^ p' ti,  jfn25  for  maximum  ratings  of  the  fm-  The 

distribution  of  p  and  p'  is  modeled  by  ip. 

Production  rules  18  and  35,  shown  in  Fig.  1,  are  used  to  generate  random  noise. 
This  represents  random  phenomena  associated  with  the  natural  state  of  the  domain 
when  no  vehicles  are  passing  through  the  region.  There  could  be  many  causes  of 
random  noise  including  temperature  changes,  equipment  malfunctions,  natural  flora 
or  fauna,  and  more.  The  amount  of  random  noise  in  a  domain  is  determined  by  the 
probabilities  associated  with  rule  18.  If  the  probability  of  the  first  RHS  is  high,  the 
domain  will  include  a  great  deal  of  noise.  If  the  probability  is  low,  very  little  noise 
will  be  generated.  The  properties  of  the  random  noise  can  be  further  modified  by 
adjusting  the  probabilities  associated  with  the  RHSs  of  rule  35. 

In  addition  to  adjustments  to  the  probability  distributions,  random  noise  charac¬ 
teristics  can  be  modified  with  the  feature  list  convention.  For  example,  note  that  the 


feature  lists  used  in  rule  18  do  not  include  any  information  about  x  and  y  locations. 
In  our  simulator,  we  determine  the  x  and  y  locations  of  random  noise  using  a  uniform 
distribution  function.  This  does  not  have  to  be  the  case.  It  would  be  easy  to  model 
a  domain  in  which  random  noise  tended  to  appear  more  in  certain  locations.  Other 
properties  of  random  noise,  such  as  its  energy  level,  can  be  manipulated  in  a  similar 
way. 

In  addition  to  random  noise,  there  is  often  noise  that  is  more  closely  associated 
with  the  domain  events  that  the  system  is  trying  to  interpret.  For  example,  in  a  vehicle 
tracking  domain,  it  is  reasonable  to  expect  there  to  be  phenomena  corresponding  to  a 
vehicle  interacting  with  the  domain  in  an  unexpected  way.  For  example,  a  train  may 
hit  a  rock  or  some  other  object  on  the  track,  something  may  fall  off  a  ship,  or  a  jet 
engine  may  emit  some  uncharacteristic  noise.  These  sorts  of  phenomena  are  different 
from  random  noise,  since  they  will  only  occur  in  the  presence  of  a  vehicle,  but  they 
are  still  a  form  of  noise  since  they  are  not  characteristic  of  a  vehicle  and  may  lead  to 
ambiguity  or  otherwise  obscure  the  sensing  of  phenomena  that  is  characteristic  of  a 
vehicle.  Rule  23  shows  an  example  of  how  this  sort  of  noise  can  be  represented  by 
adding  elements  to  the  RHS  of  an  existing  rule.  Specifically,  in  rule  23, 

Gl[y]  —*■  S2[y^]  S3[/]  S4[^],  the  extra  signal,  S4,  could  be  considered  noise. 

More  comprehensive  noise  structures  can  be  represented  at  other  levels  of  the 
grammar  in  a  similar  way.  For  example,  ghosting  phenomena  is  really  a  complex 
of  noise  at  the  track  level.  Noise  phenomena  can  be  easily  added  at  the 
group  level  in  a  manner  similar  to  that  used  to  add  signal  data  noise. 

Another  kind  of  noise  is  represented  not  by  additional  phenomena,  but  by  slightly 
altered  phenomena.  This  is  sometimes  referred  to  as  sensor  shifting.  Intuitively, 
sensor  shifting  phenomena  can  be  caused  by  errors  in  sensors  or  variations  in  a 
physical  domain  such  as  air  or  water  temperature.  All  of  the  group  level  production 
rules  include  examples  of  sensor  shifting.  For  example,  the  second  RHS  of  rule  23 
shows  a  shift  from  S2  to  S3.  Similar  shifting  cari  occur  at  all  levels  of  the  grammar. 

2.3  Correlated  and  Uncorrelated  Noise 

In  [7],  we  identified  two  kinds  of  noise,  correlated  and  uncorrelated.  Correlated 
noise  leads  to  the  generation  of  additional  high-level  interpretations.  This  ambiguity 
causes  the  problem  solver  to  perform  additional  work  to  differentiate  the  possible 
interpretations.  Uncorrelated  noise  may  lead  to  additional  work,  but  it  does  not 
lead  to  the  generation  of  additional  interpretations.  Often,  uncorrelated  noise  can 
be  identified  and  eliminated  from  further  processing  with  low-cost  pruning  operators 

m. 

The  complexity  of  the  example  grammar  is  such  that  it  is  difficult  to  identify  all 
correlated  and  uncorrelated  noise  by  inspection  alone.  However,  the  rules  associated 
with  the  non-terminal  n  include  examples  of  both.  For  example,  the  first  LHS 
of  production  rule  35:  (see  Fig.  1)  >  Sly]  is  an  example  of  correlated  noise 

because  the  signal  S 1  can  be  used  to  build  an  interpretation  of  an  I-Trackl  in  situations 
where  it  would  not  otherwise  be  built.  In  contrast,  the  last  LHS  of  production  rule 
35;  ny^t]  — >  S20y]  is  an  example  of  uncorrelated  noise  because  the  signal  data  S20  is 
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not  used  to  generate  any  vehicle  tracks  and  can  be  ignored  in  subsequent  processing. 

A  more  meaningful  example  of  correlated  noise  is  shown  in  the  problem  instance 
illustrated  in  Fig.  2.e.  In  this  example,  production  rule  18  from  Fig.  1:  Ny^t]  «[/,«] 
will  create  random  noise  throughout  the  sensed  region.  Should  some  of  this 
noise  occur  near  a  track,  it  will  cause  the  problem  solver  to  create  an  interpretation 
that  includes  the  noise  as  a  possible  element  of  the  track.  This  interpretation  could 
be  in  conflict  with  the  interpretation  that  corresponds  most  closely  with  the  event  that 
created  the  noise. 

2.4  Missing  Data 

Missing  data  is  easy  to  represent  and  understand  using  IDP  grammars.  By  simply 
dropping  an  element  from  an  RHS,  it  is  possible  to  model  phenomena  that  results  in 
data  not  being  sensed.  Missing  data  can  be  caused  by  sensor  errors,  environmental 
conditions,  processing  errors,  and  more.  Formally, 

Definition  2.2  Missing  Data  -  A  grammar,  G,  is  subject  to  missing  data  iff 
3  p  €  P,  where  p:  A  uBv  =^>  p':  A  ^  uv,  for  u,v  e  (V  U  N)*,  A  €  N,  B  €  (V  U  N),and 

fpifui  /Bi  B,v))  >  /p'(/u  » fvi  Tp'  (u,  u)).  In  addition,  the  distribution  of  p 

and  p'  is  modeled  by 

The  second  and  third  RHSs  to  rule  19  show  examples  of  how  missing  group  level 
data  can  be  modeled  by  omitting  the  appropriate  non-terminal.  The  sixth  and  seventh 
RHSs  of  rule  23  show  how  missing  signal  data  can  be  modeled  in  a  similar  way. 

The  production  rules  associated  with  ghost  tracks  also  demonstrate  how  missing 
data  phenomena  can  be  related  across  production  rules.  Note  that  the  RHSs  for  the 
ghost  group  level  nonterminals  are  identical  with  the  RHSs  for  the  regular  group 
level  nonterminals.  The  only  difference  is  in  the  distribution  probabilities.  For  ghost 
data,  the  probability  of  missing  signal  data,  or  completely  missing  group  data,  is 
much  higher.  This  is  one  of  the  ways  in  which  ghost  tracks  can  be  identified.  In  this 
example,  the  related  domain  events,  i.e.,  corresponding  levels  of  missing  data,  are 
linked  not  by  the  feature  list  convention,  but  by  a  common  non-terminal. 

3  Characterizing  Domain  Structures 

Given  a  formal  IDPq  model,  it  is  possible  to  analytically  characterize  properties 
of  the  domain  for  use  in  explanation,  prediction,  and  design  of  problem  solver 
performance.  In  [7]  we  present  analytical  techniques  for  calculating  a  variety  of 
different  domain  properties  including  expected  cost  of  problem  solving,  expected 
frequencies  of  domain  events,  expected  costs  of  individual  interpretation  search  paths, 
the  expected  cost  associated  with  incorrect  search  paths,  ambiguity  relationships, 
a  quantified  potential  value  for  meta-operators,  and  more.  Two  specific  domain 
structuies  are  the  concept  of  marker  and  differentiator.  In  the  remainder  of  this 
section,  we  demonstrate  the  IDP^  formalism  by  defining  these  two  domain  structures 
and  we  discuss  how  they  can  be  used  in  dynamic  control  strategies  and  in  the  design 
of  problem  solver  architectures. 
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The  definitions  of  markers  and  differentiators  are  relative  to  the  concept  of  a 
solution  nonterminal,  or  SNT,  described  in  Definition  1.1.  For  a  given  SNT,  an 
intermediate  result  (i.e.,  a  terminal  or  a  nonterminal  different  from  the  SNT)  is  a  strong 
marker  if  it  is  always  (or,  from  a  statistical  perspective,  almost  always)  implied  by  the 
SNT.  Information  about  markers  can  be  used  to  predict  the  occurance  and  nature  of 
low-level  events  and,  thus,  can  be  used  to  support  top-down,  model-based  processing. 
Furthermore,  analysis  of  the  component  structure  can  be  used  to  determine  the  degree 
to  which  an  intermediate  result  differentiates  an  SNT.  Differentiation  refers  to  the 
extent  to  which  an  intermediate  result  is  exclusively  associated  with  an  SNT.  The 
differentiation  relationship  is  the  inverse  of  the  marker  relationship  and  it  can  be  used 
to  support  bottom-up,  island-driving  processing. 

To  analytically  determine  marker  and  differentiator  relationships  for  a  given  SNT, 
A,  and  a  partial  result,  b,  it  is  necessary  to  define  several  relationships  between  search 
state  A  and  search  state  b.  To  accomplish  this,  it  is  necessary  to  think  of  a  given 
IDPg  grammar  as  a  definition  of  a  search  space  in  which  the  states  belong  to  classes 
corresponding  to  terminals  and  nonterminals  of  the  grammar  and  final  states  belong  to 
the  class  of  SNTs.  For  example,  in  the  sample  grammar,  the  nonterminal  V2  defines 
a  class  of  states  in  the  search  space.  During  problem  solving,  there  could  be  many 
instances  of  the  class,  each  associated  with  a  different  vehicle  2  location.  Given  the 
correspondance  between  elements  of  an  IDPg  grammar  and  states  in  a  search  space, 
it  is  possible  to  compute  relationships  between  states  by  computationally  determining 
relationships  between  elements  of  the  grammar. 

The  following  definitions  are  needed  to  formally  define  the  concepts  of  marker 
and  differentiator. 

Definitions.!  D{A)  =  P{S  H*  A),  where  A  6  {SNT}.  D{A)  defines  domain 
specific  frequency  distribution  functions  for  the  set  of  SNTs,  i.e.,  D{A)  =  probability 
of  the  domain  event  corresponding  to  interpretation  A  occurring.  In  general,  these 
distributions  will  be  represented  with  production  rules  of  the  grammar  associated  with 
the  start  symbol.  The  RHSs  of  these  rules  will  be  from  the  grammar’s  set  of  SNTs 
(i.e.,  A  €  {SNT}).  Uncertainty  regarding  this  distribution  leads  to  problem  solving 
uncertainty. 

For  example,  the  SNTs  for  the  vehicle  tracking  grammar  are  I-Trackl,  I-Track2, 
G-Trackl,  G-Track2,  P-Trackl,  and  P-Track2.  From  production  rules  2  and  3  (see 
Fig.  1),  the  domain  specific  distribution  function  for  the  SNTs  can  be  computed  from 
the  frequency  of  the  nonterminal  Track  multiplied  by  the  values  for  ip  corresponding 
to  each  of  the  SNTs.  For  example,  D{I  —  TrackX)  =  1.11*0.25  =  0.275.  Similarly, 
D{1  -  Track!)  =  0.275,  D{G  -  Trackl)  =  0.11,  D{G  -  Track!)  =  0.11, 
D{P-  Track  1 )  =  0. 1 65,  and  D{P  -  Track!)  =  0. 1 65. 

If  the  values  for  ip  in  production  rule  2  were  to  change,  these  values  would  be 
different.  For  example,  if  the  ip  values  for  the  RHSs  of  rule  2  were  both  0.5,  the 
distributions  would  be  D{I  -  Trackl)  -  !*  0.25  =  0.5,  D{I  -  Track!)  =  0.5, 
D{G  -  Trackl)  =  0.2,  D{G  -  Track!)  =  0.2,  D{P  -  Trackl)  =  0.3,  and 
D{P  —  Track!)  =  0.3. 
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Definition  3.2  {RHS{A)}  =  the  set  of  elements  that  appear  on  right-hand-sides  of 
production  rules  with  A  on  the  left-hand-side,  Ae  N  U  SNT. 

From  production  rule  20,  {RHS(V2)}  =  {G3,G8,G12}.  A  more  complex  rule 
is  26,  {RHS{GS,)}  =  {513, 514, 515, 517, 518}. 

Definition  3.3  P{b  G  {i?i75(A)}'-')  =  €  RHS,{A))  +  ZvAP{r'  € 

{RHS{A)})  *  P{b  G  {RHS{r')A^)),  where  {RHS{A)}  is  the  set  of  all  RHSs  of 
A,  RHSi{A)  is  the  potential  RHS  of  production  rule  of  A,  P{b  G  RHSi{A))  — 
■ip{RH Si{A))  if  6  G  RHSi{A),  0  otherwise,  each  element  r'  is  a  nonterminal  that 
appears  in  a  RHS  of  A  that  does  not  also  include  6,  6  G  F  U  U  SNT,  and 
A  G  U  SNT.  The  probability  of  partial  interpretation  b  being  included  in  any 
RHS  of  A,  as  defined  by  the  distribution  function  V’(A).  The  notation  indicates 
that  the  definition  of  RHS  is  recursive,  i.e.,  {i?775(A)}'‘^  represents  the  transitive 
closure  of  all  states  that  can  be  generated  from  A.  Thus,  b  can  be  in  an  RHS  of  A,  or 
in  the  RHS  of  some  element  of  an  RHS  of  A,  etc. 

From  production  rules  20,  24,  26,  and  27, 

{R7rS(F2)'-''}  =  {G3,  G8,  G12, 54,  S5, 56, 57, 58, 513, 514, 515, 517, 518}  (1) 

F(54  G  {RHS{V2)}^^)  =  (2) 

P(G3  G  {RHS{V2)})  *  P{S4  G  {PP5(G3)}''^)  =  0.65  *  0.1  =  0.065  (3) 

Definition 3.4  P{A  h*  b)  =  D{A)  *  P{b  G  {RHS{A)Y),A  G  SNT,b  e  VU 
N.  Probability  that  the  partial  interpretation,  b,  is  generated  from  full  or  partial 
interpretation  A,  where  b  is  a  descendant  of  A. 

Definition  3.5  Ambiguity  -  Given  a  domain  event.  A,  its  interpretation  is  ambiguous 
with  the  interpretation  of  a  second  domain  event,  B,  when  B  subsumes  A  (the  subsume 
relationship  is  specified  in  [7]).  i.e.,  A  is  ambiguous  with  B  when  the  low-level  signal 
data  generated  by  B  can  be  mistaken  for  an  A.  Note  that  this  definition  of  ambiguity 
is  not  reflexive.  Thus,  A  being  ambiguous  with  B  does  not  imply  that  B  is  ambiguous 
with  A. 

Definition  3.6  P{A  Gb)  =  P{A  F*  b)  -f  Evb  P{B  h*  6),  A,B  e  SNT,  b^VuN, 
and  where  the  interpretation  of  A  is  ambiguous  with  the  interpretation  of  each  B. 
Intersection  of  domain  events  A  and  b,  where  6  is  a  descendant  of  A.  The  intersection 
of  A  and  b  will  occur  when  both  A  and  b  are  generated  during  the  course  of  a  specific 
problem  solving  instance.  This  will  occur  when  A  leads  to  the  generation  of  b  and 
when  the  occurrence  of  a  distinct  event,  B,  leads  to  the  generation  of  b  and  when 
A  is  ambiguous  with  B.  In  the  case  where  B  leads  to  the  generation  of  b,  b  and  A 
still  intersect  because  an  A  will  be  generated  during  processing  since  A  is  ambiguous 
with  B. 

Definition  3.7  P{a  n  b)  =  P{a)  *  P(b  G  {RHS{a)Y^),a  G  A^,6  G  F  U  N. 
Intersection  of  domain  events  a  and  b,  where  6  is  a  descendant  of  a  and  where  a  is  a 
nonterminal  that  is  not  an  SNT. 
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Definition  3.8  P{b)  =  Z\/a  D{A)  *  P{bf]  A  \  A),A  e  SNT,  beVuNU  SNT. 
The  probability  of  partial  interpretation  b  being  generated.  The  notation  P{b  r]A  \  A 
is  the  probability  that  b  and  A  intersect  in  situations  where  event  A  is  responsible  for 
the  generation  of  the  signal  data. 

Given  these  definitions,  it  is  now  possible  to  formally  define  the  concept  of  marker: 

Definition  3.9  MARKER(A,  b)  =  where  A  is  an  element  of  the  set  SNT  and 

b  is  any  terminal  or  nonterminal.  The  value  of  the  MARKER  function  indicates  the 
probability  that  the  generation  of  an  A  will  cause  the  generation  of  a  b.  A  value  of 
MARKER(A,  b)  that  is  close  to  1  indicates  a  relationship  where  every  occurance  of  A 
indicates  that  a  b  can  be  derived.  The  value  of  MARKER(A,  b)  indicates  the  strength 
of  the  relationship  between  A  and  b. 

Definition  3.10  MARKER(a,b)  —  where  a  is  a  nonterminal  and  b  is  any 

terminal  or  nonterminal. 

Knowledge  of  markers  can  be  used  to  design  a  problem  solving  architecture 
and  to  construct  focusing  mechanisms  in  a  dynamic  control  strategy.  For  example, 
assume  that  the  vehicle  tracking  system  is  attempting  to  extend  a  track  of  type  I- 
Trackl  through  a  large  amount  of  noise  spread  over  a  wide  region.  This  could  be 
computationally  expensive  if  the  problem  solver  has  to  examine  the  implications  of 
every  possible  point  of  noise  in  a  bottom-up  fashion.  The  number  of  potential  tracks, 
and  the  cost,  will  increase  combinatorially  with  the  amount  of  noise.  Information 
about  markers  can  be  used  to  design  control  strategies  to  reduce  search  costs  in  these 
situations.  In  the  sample  grammar,  the  strongest  marker  at  the  group  level  for  an 
individual  component  of  an  I-Trackl  (i.e.,  a  vehicle  location,  VI)  is  Gl.  Specifically, 

MARKER{V1,GI)  =  P{V\  D  Gl)/P(Vl)  =  (4) 

(P(V1)  *  P{G\  e  {RHS{V1)}^^))/P{VI)  =  P{Gl  e  =  0.95.  (5) 

This  information  could  be  used  to  design  an  expectation  driven  control  strategy  for 
extending  an  I-Trackl  (from  ti  to  tj)  by  predicting  the  characteristics  of  all  Gl’s  that 
can  be  used  to  extend  the  track  to  time  tj+i  and  then  only  following  the  implications 
of  the  Gl’s  that  match  the  predictions.  Also,  this  information  could  be  used  in 
differential  diagnosis  processing  to  disambiguate  competing  hypotheses  [2].  For 
example,  in  certain  situations  it  is  possible  to  use  marker  information  to  determine 
if  a  hypothesis  was  erroneously  derived  from  noise.  By  examining  a  hypothesis’ 
supporting  data,  a  problem  solver  can  determine  if  the  support  from  a  strong  marker 
is  consistent  with  expectations.  If  the  support  is  either  much  less  or  much  greater 
than  expected,  this  would  be  a  good  indication  that  the  hypothesis  is  not  correct. 

For  example,  assume  that,  for  hypothesis  X,  MARK ER{X,  y)  =  0.99.  If  the 
problem  solver  generates  an  X  for  which  there  is  no  y  supporting  data,  the  problem 
solver  can  conclude  that  the  probability  of  this  happening  is  1  -  0.99  and  that  the 
X  might  be  erroneous.  In  the  vehicle  tracking  domain,  this  might  occur  if  there  is  a 
signal  group,  s,  that  is  a  strong  marker  for  a  particular  kind  of  track,  t.  If  the  problem 
solver  can  derive  an  interpretation  of  a  ^  without  finding  any  s  signals,  it  is  likely  that 
the  hypothesis  is  incorrect. 
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In  contrast,  the  signal  data  S 15  is  a  poor  marker  for  the  group  level  event  G8. 

MARKER{GS,  515)  =  P(G8  fi  515)/P(G8)  =  (6) 

(P(G8)  *  P(515  €  {PP5(G8)}''‘^))/P(G8)  =  P(515  G  {PP5(G8)}'"')  =  0.1.  (7) 

A  formal  definition  of  the  differentiator  relationship  can  be  given  as; 

Definition  3.11  DIFF(A,  b)  =  where  A  e  SNT,  and  6  6  y  U  A^U  5iVr  is  any 
terminal  or  nonterminal.  A  value  of  DIFF(A,  6)  that  is  close  to  1  indicates  a  strong 
causal  relationship  between  A  and  b  to  the  exclusion  of  all  other  causes.  A  value  of 
DIFF(A,  b)  that  is  close  to  0  indicates  a  weak  causal  relationship  between  A  and  b. 

Knowledge  about  differentiators  can  be  used  both  in  the  design  of  problemsolving 
architectures  and  dynamic  control  algorithms.  Architecturally,  differentiators  can  be 
used  to  construct  special  operators  for  differential  diagnosis  [2].  In  control  algorithms, 
differentiators  can  be  used  to  focus  problem  solving  activity  [3]. 

In  the  vehicle  tracking  grammar,  SI,  S2,  and  S3  are  strong  differentiators  for  the 
event  VI  (a  vehicle  location  of  type  1).  Specifically, 

£»/FP(Vl,51)  =  P(Kin51)/P(5l)=  (8) 

{P{Vl)  *  P(51  e  {Pi?5(Fl)}'-"))/P(51)  =  0.55  *  0.70/0.39  =  0.99.  (9) 

(Note  that  SI  is  also  generated  by  production  rule  35,  and,  consequently,  P(51)is 
greater  than  the  probability  of  SI  being  derinved  solely  from  VI.)  Similarly, 
DIFF(V1,  S2)  =  0.55  *  0.70/0.39  =  0.99  and  DIFF(I/1, 5'3)  =  0.55  *  0.3/0.17  = 
0.97.  Given  a  large  amount  of  noise,  a  possible  control  strategy  would  be  to  de¬ 
termine  if  there  was  a  large  amount  of  SI,  S2,  and  S3  events  in  the  data.  If  there 
are,  there  is  a  strong  likelihood  that  they  were  generated  by  a  track  with  VI  as  a 
component.  This  information  could  be  used  to  filter  out  data  that  are  weak  markers 
for  VI.  For  example,  MARKER(V1,S8)  =  0.07.  Consequently,  ignoring  S8  data 
might  be  a  reasonable  strategy  in  this  situation. 

In  general,  the  marker  and  differentiator  relationships  can  be  used  to  explain  the 
success  of  techniques  such  as  approximate  processing  and  incremental  planning  [1]. 
From  a  bottom-up  perspective,  the  differentiator  relationship  can  identify  the  interme¬ 
diate  results  that  are  most  appropriate  for  abstracting  and  clustering.  From  a  top-down 
perspective,  the  marker  relationship  can  help  predict  the  expected  characteristics  of 
intermediate  results  derived  from  model-driven  processing  and  thus  focus  processing 
by  filtering  out  noise. 

Though  not  discussed  in  this  paper,  the  concepts  of  marker  and  differentiator  can 
be  further  refined  to  include  information  about  the  credibilities  of  the  hypotheses. 
For  example,  the  relationships  could  vary  significantly  for  low-credibility  events  and 
high-credibility  events.  Given  a  track  level  hypothesis  that  has  a  low-credibility, 
its  relationship  with  markers  could  be  quite  different  from  that  of  a  hypothesis  with 
similar  characteristics  and  a  high-credibility.  The  same  is  true  for  differentiator 
relationships.  A  low-level  hypothesis  with  a  high-credibility  may  be  a  much  better 
differentiator  than  a  similar  hypothesis  with  a  low-credibility. 
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4  Conclusion 


The  techniques  presented  in  this  paper  for  formally  modeling  interpretation  domains 
represent  a  necessary  step  toward  developing  a  basis  for  explaining,  predicting,  and 
designing  the  performance  of  sophisticated  knowledge-based  problem  solvers.  By 
explicitly  modeling  a  problem  domain,  the  IDPg  formalism  can  be  used  to  exper¬ 
imentally  test  the  effects  of  alternative  control  strategies.  For  example,  a  specific 
strategy  can  be  applied  in  different  domains  by  using  slightly  altered  versions  of  an 
IDPg  grammar.  This  could  be  done  to  identify  the  characteristics  of  domains  in 
which  the  control  strategy  is  most  effective.  Alternatively,  different  control  strategies 
can  be  compared  and  contrasted  statistically  by  using  an  IDPg  grammar  to  generate 
numerous  experimental  problem  instances. 

More  importantly,  by  explicitly  specifying  a  problem  domain,  IDPg  formalism 
makes  it  possible  to  analyze  its  characteristics  in  ways  that  can  be  used  to  design 
control  strategies.  For  example,  general  domain  structures  such  as  differentiators 
and  markers  can  be  identified  and  used  to  design  meta-operators.  These  structures 
represent  expectations  that  are  derived  from  statistical  abstractions  of  a  domain. 
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Abstract 

Coordination  is  an  essential  technique  in  cooperative,  distributed  multi¬ 
agent  systems.  However,  sophisticated  coordination  strategies  are  not  al¬ 
ways  cost-effective  in  all  problem-solving  situations.  This  paper  presents  a 
learning  method  to  acquire  coordination  plans  for  specific  problem-solving 
situations  so  that  the  appropriate  type  of  coordination  strategy  is  used. 
This  learning  is  accomplished  by  recording  and  analyzing  traces  of  in¬ 
ferences  after  problem  solving.  The  analysis  results  in  identification  of 
situations  where  inappropriate  coordination  strategies  have  caused  redun¬ 
dant  activities  or  the  lack  of  timely  execution  of  important  activities,  thus 
degrading  system  performance.  Based  on  this  identification,  situation- 
specific  coordination  plans  are  created  which  use  additional  non-local  in¬ 
formation  about  activities  in  the  networks  to  remedy  the  problem.  An 
example  from  a  real  distributed  problem-solving  application  involving  di¬ 
agnosis  of  a  local  area  network  is  described. 

CFP  Topics:  Learning  and  Adaptation  in  Multi-agent  Worlds; 

Coordination;  Organizational  Self-Design 
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1  Introduction 

Achieving  globally  coherent  activity  in  a  cooperative,  distributed  multi-agent 
system  is  a  difficult  problem  for  a  number  of  reasons.  One  difficulty  is  that  an 
agent’s  control  decisions,  based  only  on  its  local  view  of  problem-solving  task 
structures,  may  lead  to  inappropriate  decisions  about  which  activity  it  should  do 
next,  what  results  it  should  transmit  to  other  agents  and  what  results  it  should 
ask  other  agents  to  produce.  If  an  agent  has  a  view  of  the  task  structures  of  other 
agents  it  can  make  more  informed  choices  [6,  7,  11].  Another  difficulty  is  that  even 
with  the  availabihty  of  this  type  of  meta-level  information,  there  is  stiU  residual 
uncertainty  about  outcomes  of  tasks  and  what  future  tasks  will  be  coming  into 
the  system  which  may  result  in  agents  still  exhibiting  non-coherent  behavior. 
These  difficulties  with  achieving  effective  coordination  are  further  exacerbated 
by  the  fact  that  an  agent,  in  acquiring  and  exploiting  a  non-local  view  of  other 
agents’  activities,  may  expend  significant  computational  resources.  This  expense 
is  in  terms  of  communication  delays  waiting  for  this  information  to  arrive,  as  well 
as  the  computational  cost  of  both  providing  this  information  in  a  suitable  form 
to  other  agents  and  processing  this  information  to  make  local  control  decisions. 
Thus,  for  specific  problem-solving  situations,  due  to  the  inherent  uncertainty  in 
agents’  activities  and  the  cost  of  meta-level  processing,  it  may  not  be  worthwhile 
to  acquire  a  complete  view  of  other  agents’  activities,  and  thus  some  level  of 
non-coherent  activity  may  be  the  optimal  coordination  strategy  [11]. 

For  example,  coordination  to  avoid  redundant  activities  may  be  unnecessary 
if  processing  resources  are  not  overloaded  and  if  the  communication  channel  is 
neither  expensive  nor  overloaded.  In  this  case,  local  problem  solving  is  done  more 
efficiently  where  there  is  no  additional  overhead  for  coordination.  If  a  coordina¬ 
tion  strategy  can  be  developed  whose  costs  can  be  varied  depending  upon  the 
amount  and  type  of  non-local  information  it  uses  to  make  coordination  decisions, 
then  it  seems  that  only  a  selected,  possibly  situation-specific,  view  of  other  agents’ 
activities  is  necessary.  The  obvious  next  question  is  how  to  determine  what  the 
appropriate  situation-specific  view  is  and  what  type  of  coordination  rules  should 
be  used  in  the  situation.  It  is  our  hypothesis  that  for  many  multi-agent  apphca- 
tions,  especially  those  operating  in  complex,  open  and  possibly  evolving  environ¬ 
ments,  it  is  very  difficult  or  impossible  for  the  designer  of  a  system  to  a  priori 
anticipate  all  the  problem-solving  contexts  and  exactly  which  information  and 
what  coordination  strategy  for  each  context  will  be  most  cost-effective.  Thus,  in 
this  paper,  we  propose  integrating  into  each  agent  a  distributed  learning  com¬ 
ponent  that  agents  can  use  to  acquire  through  experience  which  information  is 
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most  effective  for  a  specific  situation  and  how  to  exploit  this  information. 

In  the  remainder  of  the  paper,  we  develop  this  distributed  learning  compo¬ 
nent  in  detail,  based  on  EBL  techniques  [5,  12]  using  a  Hmited  domain  model  and 
inductive  techniques  such  as  compzirative  aneilysis  [9],  and  discuss  the  implemen¬ 
tation  of  these  ideas  in  areal  distributed  problem-solving  system,  LODES  [15, 16], 
which  performs  internetwork  diagnosis. 

2  An  Example  Problem 

The  LODES  network  diagnosis  system  observes  message  traffic  on  the  network 
in  order  to  detect  and  analyze  situations  which  indicate  a  hardw2ire  or  software 
problem  in  the  network.  The  LODES  system  becomes  a  multi-agent  system  when 
the  network  is  implemented  as  multiple  communication  segments.  In  this  case,  a 
separate  LODES  agent  is  placed  on  each  segment  and  is  responsible  for  monitor¬ 
ing  traffic  on  that  segment  of  the  network  and  diagnosing  any  problems  that  are 
recognized.  LODES  agents  start  out  with  the  assumption  that  there  is  no  need 
to  expUcitly  coordinate  with  other  LODES  agents  on  different  segments  of  the 
network  (i.e.,  their  subjective  views  used  for  coordination  are  based  only  on  their 
local  task  structure).  The  only  interaction  among  agents  is  an  occasional  request 
for  a  diagnostic  task  that  can  be  performed  by  only  the  LODES  agent  that  is 
located  on  a  specific  network  segment  and  the  sharing  of  information  about  the 
network  configuration  and  the  final  results  of  diagnosis.  The  basic  assumption 
behind  this  lack  of  coordination  is  that  there  are  sufficient  communication  and 
computational  resources  available  on  the  network  to  sustain  a  certain  level  of 
non-coherent  behavior  (e.g.,  two  agents  diagnosing  the  same  problem).  This  as¬ 
sumption  is  appropriate  for  diagnostic  activities  in  most  network  environments 
and  was  the  basis  for  how  the  system  was  originally  implemented.  However,  it  is 
not  a  vahd  assumption,  for  example,  in  network  environments  where  diagnostic 
activities  generate  a  lot  of  message  traffic,  and  where  there  axe  multiple  agents 
performing  redundant  diagnosis  and  the  cost  of  communication  is  significant. 
Consider  the  following  situation  in  the  network  environment  as  shown  in  Fig.  1, 
where  LI  to  L7  are  LODES  agents,  Netl  to  Net?  are  network  segments,  and  Net5 
and  Net6  are  connected  with  a  narrow-bandwidth  hne.  Suppose  a  host,  on 
Netl,  sends  a  broadcast  to  aU  hosts  on  Net?,  but  most  of  the  hosts  cannot  un¬ 
derstand  that  protocol.  Upon  receiving  the  broadcast,  they  simultaneously  send 
back  error  packets  in  order  to  inform  that  they  have  discarded  its  broadcast 
packet  (this  problem  is  called  the  first  problem).  In  this  situation,  multiple  agents 
LI  ...  L?  will  be  concurrently  diagnosing  the  same  problem  since  all  wiU  see  the 
messages  indicating  the  discarding  of  the  broadcast  packets.  In  responding  to 
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Figure  1:  Network  Environment  for  the  Example  Problem 

this  problematic  situation,  each  LODES  agent  may  independently  send  diagnos¬ 
tic  test  packets  through  the  network  to  understand  its  cause,  which  could  be  for  a 
variety  of  reasons.  This  redundant  diagnostic  message  traffic  may  in  turn  lead  to 
another  problematic  situation  if  the  network  environment  has  network  segments 
which  are  implemented  as  narrow-bandwidth  lines,  or  if  a  tariff  is  associated  with 
each  message,  or  if  there  are  other  problems  occuring  in  the  network  that  also 
need  to  be  diagnosed  and  computational  resources  are  being  used  redundantly. 
This  is  a  problematic  situation  since  it  indicates  the  potential  overloading  of  an 
expensive  or  scarce  resource.  This  secondary  problem,  which  is  directly  caused  by 
LODES  agents  activities,  can  be  attributed  to  the  lack  of  effective  coordination 
among  agents. 

Note  that  one  of  the  interesting  aspects  of  our  diagnosis  application  is  that, 
in  this  example  problem,  the  LODES  agents,  in  their  monitoring  of  the  network, 
wiU  detect  the  secondary  problem.  In  this  case,  agents  L5  and  L6  will  decide 
that  the  secondary  problem  is  not  tolerable  because  they  know  the  existence  of 
the  narrow-bandwidth  line.  The  existence  of -this  problem  and  the  fact  that  it 
was  caused  by  the  LODES  agents  themselves  wiU  be  the  trigger  for  the  learning 
component  to  be  invoked. 

3  The  Learning  Framework 

Our  approach  to  learning  coordination  rules  can  be  described  in  the  coordina¬ 
tion  model  proposed  by  Decker  and  Lesser  [3,  4]  based  on  generic  coordination 
relationships.  In  this  model,  each  agent  makes  scheduUng  decisions  based  on  a 
subjective  view  of  its  own  and  other  agents’  task  structures  and  the  relation¬ 
ships  among  these  tasks  (such  as  enable-^  facilitate-^  and  support-relations^)  and 

This  is  a  new  relationship  that  was  not  in  the  original  formulation  discussed  in  [3]  and  relates 
to  how  a  task  in  one  agent  can  effect  the  subjective  view  of  another  agent’s  task  structure  by 
changing  the  importance  rating  of  its  tasks.  This  rating  change  can,  in  turn,  cause  the  agent 
to  choose  one  task  over  another  for  execution. 
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resource  usage  patterns  (such  as  e-relation).  These  task  and  resource  rela¬ 
tionships  are  not  only  qualitative  but  also  have  quantitative  characteristics.  In 
certain  situations  this  subjective  view  will  lead  to  inefFective/inappropriate  ac¬ 
tions  because  certain  non-local  interrelationships  have  not  been  appropriately 
taken  into  account;  this  can  occur  because  the  limited  subjective  agent  view  does 
not  include  an  important  relationship  or  the  quantitative  character  of  the  re¬ 
lationship  is  incorrectly  inferred  because  of  the  use  of  a  default  assumption  or 
out-of-date/incorrect  information. 

Before  presenting  our  learning  framework,  we  introduce  the  following  termi¬ 
nology.  During  problem  solving,  an  agent  selects  a  goal  and,  to  achieve  this  goal, 
a  task  is  executed.  A  task  consists  of  a  partially  ordered  set  of  subtasks.  Each 
subtask  also  has  an  associated  subgoaL  The  execution  of  a  task  means  that  all 
the  subtasks  are  executed  in  a  manner  consistent  with  the  specified  partial  order. 
A  subtask  may  further  be  divided  into  smaller  subtasks.  The  individual  task 
that  cannot  be  divided  into  subtasks  is  called  an  operation  (in  [3],  this  is  called 
a  method),  which  must  be  an  executable  form.  There  are  usually  a  number  of 
different  tasks  that  can  be  used  to  achieve  a  goal.  The  LODES  planner  specifies 
the  alternative  task  structures  to  achieve  a  specific  diagnostic  gocil  and  then  se¬ 
lects  an  appropriate  task  among  them.  This  selected  task  is  called  a  plan.  The 
local  scheduler  selects  an  operation  that  should  be  executed  next  based  on  the 
plan.  During  this  execution,  communication  activities  such  as  transmitting  an 
operation’s  outcome  or  requesting  information  may  occur. 

The  knowledge  used  by  our  learning  framework  includes  a  heuristic  collection 
of  rules  and  procedures  for  1)  recognizing  situations  where  there  is  costly  incoher¬ 
ent  behavior^,  2)  identifying  control  decisions  that  lead  to  this  behavior,  and  3) 
modifying  these  control  decisions  or  replacing  them  with  new  decision  processes 
that  rectify  the  inappropriate  control.  When  an  undesirable  situation  is  observed, 
the  learning  component  first  identifies  the  mainstream  tasks  and  messages,  which 
contributed  to  achieving  the  final  result  of  the  LODES  diagnostic  process.  It 
then  locates  in  the  trace  which  tasks  induced  the  observed  undesirable  situation. 
Next,  agents  rebuild  the  situation  model  they  had  locally  in  the  diagnostic  pro¬ 
cess.  These  models  are  exchanged  and  agents  generate  a  more  comprehensive 
model  about  the  observed  situation.  Agents  then  reproduce  the  inference  pro¬ 
cess  based  on  this  model.  Two  cases  are  possible  depending  on  the  result  of  the 

^This  is  done  by  the  meta-level  controller  within  an  agent  or  by  an  external  monitor  for 
specific  (shared)  resources  (such  as  network  resource  or  database).  What  kinds  of  events  are  de¬ 
tected  are  described  in  [17].  It  is  also  assumed  that  an  agent  locally  records  an  abstracted  trace 
of  its  recent  problem-solving  actions  which  can  be  reviewed  on-line  by  the  learning  component. 
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reproduction.  The  first  case,  which  we  will  call  a  lack- of -information  problem, 
can  be  solved  in  a  very  local  way  by  just  choosing  an  alternative  task  to  satisfy 
the  given  diagnostic  goals.  In  this  case,  control  rules  are  added  to  the  planner  to 
choose  the  alternative  task  for  satisfying  the  given  goal  in  this  particular  situa¬ 
tion.  In  essence,  the  learning  component  determines  what  aspects  of  the  task  and 
resource  structure,  which  if  taken  into  account  in  the  scheduling  of  local  activities 
would  lead  to  more  coordinated  behavior;  it  then  creates  the  rule  to  acquire  or 
calculate  this  data  (i.e.,  the  agent ^s  subjective  view  of  the  relationship  between 
its  activities  and  those  of  other  agents  is  extended).  The  second  case,  which  we 
wiU  call  an  incorrect- control  ^Tohlem,  is  not  solved  solely  by  making  existing  rules 
more  context  sensitive  with  respect  to  the  external  agent  environment.  In  this 
case,  control  rules  for  a  new  situation-specific  scheduling  strategy  are  added  to 
increase  the  ratings  of  appropriate  tasks  or  alter  the  coordination  strategy  used 
by  agents  in  this  specific  case. 

This  basic  paradigm  of  monitoring,  analysis  and  replanning  (learning,  in  this 
case)  is  very  similar  to  the  one  laid  out  but  never  implemented  for  use  in  or¬ 
ganizational  self-design  [2]  and  a  partially  implemented  approach  to  meta-level 
control  for  a  single-agent  system  [8]. 

4  Learning  Steps 

This  section  details  the  steps  of  the  learning  process  using  the  example  problem 
discussed  previously. 

4.1  Recording  Inference  Traces 

An  agent  must  be  able  to  record  traces  of  reasoning  for  analysis  by  the  learning 
component.  Traces  have  to  contain  data  describing  the  following  aspects  of  local 
problem  solving:  (1)  executed  tasks  and  operations;  (2)  task  relationships;  (3) 
information  communicated;  (4)  resource  usage  (if  possible);  (5)  domain  data  used; 
(6)  domain  knowledge  used;  and  (7)  control  knowledge  used.  All  this  information 
is  stored  with  a  time  stamp.  An  agent  must  be  able  to  reproduce  the  same 
decisions  and  reasoning  from  the  recorded  trace. 

4.2  Mainstream  Tasks  and  Messages 

The  first  step  in  the  learning  process  is  to  identify  the  mainstream  tasks  and 
messages.  These  can  be  obtained  by  tracing  backwards  through  task  relations 
from  the  final  result  in  the  problem-solving  trace  as  follows.  The  task  that  directly 
led  to  the  final  result  is  a  mainstream  task.  A  task  that  has  a  task  relation  with 
a  mainstream  task  is  a  mainstream  task.  A  message  that  produced  a  mainstream 
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Condition: 

execution-orderdl  T2  T3) 
mstCTi)  ,nist(T3) 
not-mst--b«tve®n(Tl  T3) 
task~pr oposod-by (T3 , Ml ) 

task-proposed-by (T2 ,Ti) 

arrive-vhen(Ml ,T2) 


msg-status(Hl)  =  Analysis-Postponed 
Then:  mess age -has -the- low-rating (Ml) 


Order  of  executions 

T1  and  73  were  the  Mainstream  Tasks . 

No  mainstream  tasks  were  executed  after  T1  and  before  T3. 
T3  was  proposed  based  on  message  Ml. 

T2  was  proposed  based  on  the  result  of  Tl. 

Ml  arrived  before  T2  was  executed. 

The  rating  of  the  message  was  so  low 
that  its  analysis  was  postponed. 


Figure  2:  An  Example  of  Learning  Heuristics 
This  rule  describes  the  situation  where  the  execution  of  a  mainstream  task  (T3)  is 
delayed  (task  T2  that  does  not  contributed  to  the  final  result  is  executed  before  T3) 
because  an  important  message  (Ml)  has  been  lowly  rated. 


task  is  a  mainstream  message.  A  task  that  produced  a  mainstream  message  is  a 
mainstream  task.  A  task  that  produced  the  content  of  a  mainstream  message  is 
a  mainstream  task.  This  process  of  computing  the  mainstream  extends  beyond  a 
single  agent’s  local  problem  solving  to  incorporate  activities  of  agents  distributed 
throughout  the  network. 

4,3  LAP  Detection 

The  second  step  of  the  learning  process  is  to  isolate  situations  in  which  the 
following  inappropriate  actions  occurred. 

(Dl)  Execution  of  unnecessary  actions;  tasks  that  do  not  belong  to  the  main¬ 
stream  set  axe  redundant  or  unnecessary  for  achieving  the  goal  of  problem 
solving  for  the  current  episode. 

(D2)  Tasks  that  were  not  executed  in  a  timely  manner;  these  are  usually  caused 
by  inappropriate  task  allocation,  delay  of  communications,  or  by  executions 
of  unnecessary  tasks. 

(D3)  Longer  than  expected  task  durations;  this  situation  is  usually  caused  by 
resource  overloads  or  by  inappropriate  scheduling  in  other  agents. 

(D4)  Redundant  mainstream  task;  for  example,  a  variable  is  defined  twice  or  its 
value  is  redundantly  sent  from  other  agents. 

(D5)  Tasks  that  induced  problematic  external  actions;  see  the  example  problem. 

These  problem-solving  situations  axe  called  learning  analysis  problems  (LAPs) 
and  indicate  situations  where  non-coherent  behavior  has  taken  place. 

For  example,  the  rule  in  Fig.  2  shows  the  situation  where  the  mainstream 
task  T3  has  not  been  executed  in  a  timely  manner  because  the  message  Ml,  which 
would  cause  the  task  T3  to  be  instantiated  and  executed,  has  been  inappropriately 
rated  causing  the  processing  of  message  Ml  to  be  delayed. 
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Figure  3:  Plans  created  during  diagnosis  of  the  example  problem 

In  the  example  problem,  the  LAP  detection  identifies  the  task(s)  in  which 
the  test  packets  were  sent  by  analyzing  the  trace  of  the  first  diagnostic  problem 
solving  (e.g.,  an  example  of  a  D5  situation).  Fig.  3  illustrates  the  mainstream 
tasks.  Agents  can  identify  that  the  observed  test  packets  were  sent  in  the  plan 
P3  named  “Get-RTT-between-Both-Ends”  in  this  step. 

4.4  Task  Structure  Analysis 

The  third  step  of  the  learning  process  involves  agents  building  a  more  compre¬ 
hensive  and  objective  model  to  describe  the  detected  LAP. 

(Al)  Each  agent  creates  the  (subjective)  model  about  the  task  structure  and 
resource  usages  when  the  LAP  occurred. 

(A2)  The  agent  identifies  the  reason  why  the  task  causing  the  LAP  was  selected. 
The  model  created  in  Al  can  be  thought  of  as  subjective  because  it  is 
based  on  what  information  an  agent  used  in  its  deliberations — not  what 
information  was  actually  available. 

(A3)  AU  involved  agents  exchange  their  subjective  model  and  generate  more 
comprehensive  models.  These  models  include  goals,  scheduled  tasks,  in¬ 
ference  states  and  resource  usages  of  all  agents  as  well  as  other  observed 
domain  data.  These  models  are  furthermore  extended  by  adding  results  of 
non-mainstream  tasks.  These  results  are  also  exchanged. 


(A4)  The  agent  that  executed  the  task  causing  the  LAP  reproduces  the  inference 
using  the  generated  comprehensive  model.  Depending  on  whether  or  not 
the  agent  selects  the  same  task,  the  LAP  can  be  seen  as  being  caused  by 
either  the  lack-of-information  or  the  incorrect-control  problem  as  described 
in  Section  3.  If  it  is  the  lack-of-information  problem,  the  agent  puts  marks 
on  pieces  of  the  comprehensive  model  that  are  used  to  select  the  appropriate 
task. 

In  the  example  problem,  agents  create  a  comprehensive  model  about  the  sit¬ 
uation  in  which  the  plan  “Get-RTT-between-Both-Ends”  was  selected.  Agents 
select  this  plan  to  estimate  the  network  load  and  bandwidth  by  gathering  statis¬ 
tics  on  the  round-trip  time  (RTT)  of  a  number  of  test  packets  sent  into  the 
network.  This  plan,  which  is  not  the  optimal  way  to  gather  this  statistic,  pro¬ 
duces  only  an  approximate  value  for  RTT.  A  better  plan  would  have  been  to  use 
the  Simple  Network  Management  Protocol  (SNMP),  which  is  a  communication 
protocol  designed  for  acquiring  network  management  data  [1],  but  this  was  not 
possible  in  the  example  since  the  adjacent  network  routers  did  not  implement  it. 
An  agent,  in  order  to  understand  what  other  tasks  agents  are  executing,  must 
know  that  local  routers  do  not  use  the  SNMP,  and  that  all  agents  along  the  route 
from  Net  7  to  Net  1  will  be  performing  the  same  diagnoses.  Agents  can  then  un¬ 
derstand  that  the  identical  task  was  selected  in  other  agents.  This  view  of  which 
tasks  were  to  be  executed  in  other  agents  was  not  present  and  thus  responsible  for 
the  LAP  occurring;  this  is  one  of  the  important  parts  of  the  enhanced  subjective 
view  necessary  for  achieving  more  coordinated  activities  (see  Fig.  4  (a)  and  (b)). 

4.5  Coordination  Control  Modification 

The  fourth  step  involves  adding  or  modifying  control  rules  for  coordination  based 
on  the  analysis  of  the  model  developed  in  the  previous  step.  The  approach  to 
modification  depends  on  the  types  of  problems  identified  in  the  previous  step. 
A  lack-of-information  problem  occurs  when  an  agent  has  the  control  rule  to  se¬ 
lect  another  appropriate  task  but  (1)  lacks  sufficient  time  to  acquire  and  analyze 
necessary  information  or  (2)  lacks  rules  for  acquiring  necessary  information.  In 
the  former  case,  EBL  techniques  can  be  used  to  develop  a  more  efficient  ver¬ 
sion  of  the  control  rule  that  meets  the  timing  constraints.  In  the  latter  case, 
rules  for  acquiring  the  missing  information  are  added.  These  types  of  rules  may 
include  requesting/sending  data  and  changing  the  order  of  communication/task 
executions. 

For  an  incorrect-control  problem,  agents  have  to  generate  a  new  rule  or  mod¬ 
ify  an  existing  rule  in  order  to  reach  the  alternative  control  using  the  following 
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(c)  The  learned  rule 

IF  Min~of~MaX“Thrput  <  64Kb  ;;  that  is,  the  type  of  resource  is  strictly  restricted 

and  the  plan  '’Get-RTT-between-Both-Ends'* 

is  selected  in  more  than  two  agents  ..  jg  model  of  the  current  situation  is  identical  to  (b) 
Then  '  ^ 

Share  the  result  of  the  plan  "Get-RTT-between-Both-Ends." 

Figure  4:  Models  for  Problem  Solving:  (a)  is  the  model  created  in  LI  before 
learning.  This  model  does  not  include  the  resource  type  and  other  agent’s  intended 
tasks,  (b)  is  the  model  that  will  be  created  in  LI  ...  L7  after  lecirning,  and  (c)  is  the 
rule  finally  derived  by  learning.  The  model  (b)  includes  important  data  involved 
in  coordination  activities.  This  model  is  created  to  apply  the  rule  (c);  To  check  its 
“if-part,”  first  the  value  of  “Min-of-Max-Thrput”  is  c€dculated  then  other  agents’ 
planned  tasks  are  gathered.  Then  the  learned  coordination  control  takes  place  when 
the  acquired  model  is  identical  to  the  model  (b). 


learning  knowledge: 

(Ml)  Changing  the  rating  of  the  specific:  goal  and  message  so  that  the 
appropriate  task  is  selected  in  the  specific  situation.  For  example,  the  LAP 
found  by  the  rule  in  Fig.  2  will  be  fixed  by  increasing  the  rating  of  the 
message  Ml  by  its  sender  or  receiver. 

(M2)  Changing  the  order  of  operations  and  communications  since  some 
operations  are  often  locally  commutative.  The  purpose  of  this  change  is 
to  speed  up  the  execution  of  another  agent  which  is  waiting  for  the  result 
of  an  operation  that  can  be  done  earlier  in  the  local  agent  without  any 
consequence. 

(M3)  Allocating  a  task  to  another  idle  agent  in  order  to  take  advantage  of 
concurrent  execution. 

(M4)  Using  results  calculated  by  another  agent  in  order  to  avoid  redundant 
actions.  In  this  situation,  one  agent  executes  the  corresponding  task  and 
others  wait  for  the  result. 
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(M5)  Selecting  another  (sub) task  that  is  expected  to  produce  the  same  out¬ 
come  but  does  not  contain  the  operation  that  caused  the  LAP. 

In  the  example  problem,  if  agents  have  the  control  rule  that  can  derive  ap¬ 
propriate  (at  least,  different)  coordinated  actions  from  the  comprehensive  model, 
only  the  rule  to  acquire  information  about  the  resource  type  and  usages  by  other 
agents  is  generated  (lack-of-information  problem).  It  is  possible,  however,  that 
agents  cannot  select  another  task  based  on  the  comprehensive  model.  This  is 
because  the  model  does  not  include  the  resource  type  information  and/or  agents 
do  not  have  the  control  rules  to  select  another  task  (incorrect-control  problem). 
In  such  a  case,  agents  modify  their  control  by  applying  (M4)  so  that  the  LODES 
agents  performing  the  same  diagnostic  activity  can  share  the  results  of  the  plan 
executed  by  a  single  agent  (see  Fig.  4). 

4.6  Situation  Identification 

Since  the  learned  rules  are  situation-specific,  agents  have  to  identify  the  situation 
in  which  the  learned  rule  should  be  apphed.  The  purpose  of  this  step  is  to  create 
the  part  of  the  rule  for  identifying  the  situation.  The  situation  must  be  identified 
based  on  the  comprehensive  model  constructed  in  step  four.  However,  a  complete 
version  of  this  comprehensive  model  is  usually  over-specific;  we  think  that  only  a 
pzirt  of  this  model  is  essential  to  characterize  the  situation  where  the  derived  rule 
should  be  applied.  The  question  we  then  face  is  what  part  of  the  comprehensive 
model  is  necessary  to  uniquely  identify  the  problematic  situation. 

An  inductive  method  is  introduced  to  solve  this  identification  problem.  First, 
the  past  traces  and  other  agents’  traces  containing  the  same  LAP  as  well  as  the 
last  reasoning  trace  are  gathered®,  then  these  traces  are  divided  into  two  types  of 
instances — positive  and  negative  instances.  For  example,  if  a  LAP  is  identified 
by  the  rule  in  Fig.  2,  the  traces  include  the  situation  where  Tl,  T2,  T3  and  Ml 
cire  bound  to  the  same  veJues  are  gathered.  If  the  same  LAPs  are  developed, 
the  traces  are  positive;  if  no  or  different  LAP  are  developed,  they  are  negative. 
When  a  veiriable  used  in  the  different  types  of  instances  has  the  same  value  or 
when  a  variable  used  in  the  same  type  of  instances  has  different  values,  we  can 
conclude  that  the  variable  is  not  necessciry  for  distinguishing  the  situation  from 
others.  This  process  of  comparison  is  called  comparative  analysis  (CA)  [8]. 

In  this  example,  however,  since  LI  ...  L7  diagnose  the  same  secondary  prob¬ 
lem  in  this  example,  agents  can  identify  the  situation  by  comparing  their  traces 

®This  approach  assumes  that  we  have  stored  some  amount  of  history  in  the  agent  about 
past  problem-solving  experiences  in  a  way  to  facilitate  such  analysis  [17]. 
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Table  1  (a):  Comparative  Analysis  (L5  and  L6) 


Variables 

values  in  L5 

values  in  L6 

Adjacent  networks 

Net4,  Net6 

Nets  and  Net? 

eliminated 

Adjacent  routers 

R4,  R5 

R6,  R7 

eliminated 

End-nodes 

(LI,  L7) 

Src-MAC 

xx:xx:xx:f:2a:3b 

Dst-MAC 

xx:xx:xx:f:2a:3a 

xx:xx:xx:0;2f:7e 

eliminated 

Type-of-Storm 

ICMP-Echoes 

ICMP-Echoes 

MaxThrputl 

10,000,000 

64,000 

eliminated 

MaxThrput2 

64,000 

10,000,000 

eliminated 

Min-of-MaxThrput 
(Quantitative  Measure) 

64,000 

64,000 

64,000 

(max  of  L5  and  L6’s 
values) 

Observed-Number-of- 

Echoes 

(Quantitative  Measure) 

68 

61 

61 

(min  of  L5  and  L6’s 
values) 

Current-Traffic 

low 

low 

Table  1  (b):  Comparative  Analysis  (L4  and  L5) 


Variables 

values  in  L4 

values  from  the 
previous  CA 

End-nodes 

(Ll,  L7) 

(Ll,  L7) 

eliminated 

Type-of-Storm 

ICMP-Echoes 

ICMP-Echoes 

eliminated 

Min-of-MaxThrput 

(Quantitative 

Measure) 

10,000,000 

64,000 

Observed-Number-of- 

Echoes 

(Quantitative  Measure) 

68 

61 

eliminated 

(almost 

identical) 

Current- Traffic 

low 

low 

eliminated 

We  assume  that  L5  claimed  the  secondary  problem.  L6  also  claimed  the  same  problem 
thus  its  trace  is  the  positive  instance.  L4  did  not  so  its  trace  is  negative.  All  variables  are 
used  in  the  diagnosis  of  the  secondary  problem.  Only  the  variable  Min-of-MaxThrput  that 
indicates  the  minimum  of  the  local  throughput  to  adjacent  network  segments  remains. 


of  reasonings  about  the  secondary  problem.  Although  LI  to  L7  observe  the  sec¬ 
ondary  problem,  only  L5  and  L6  concluded  that  the  problem  was  not  tolerable, 
because  they  knew  the  existence  of  a  naxrow-bandwidth  line.  This  difference  is 
expressed  by  the  variable  Min-of-MaxThrput;  other  variables  are  ehminated  by 
the  CA  (cf.  Table  1).  The  results  finally  obtained  through  learning  axe  illustrated 
in  Fig.  4. 

Note  that  after  this  learning,  an  agent’s  action  may  still  cause  the  same  or 
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another  problem.  For  example,  the  situation  where  the  learned  rules  should  be 
applied  cannot  correctly  be  identified  because  of  insufficient  instances.  Thus  the 
LAP  will  occur  again.  In  another  case,  the  appropriate  inference  strategy  will 
change  over  time  in  a  gradually  evolving  environment.  In  both  cases,  however, 
by  applying  this  learning  iteratively,  agents  can  incrementally  converge  on  the 
correct  rules. 

5  Empirical  Results 

AU  aspects  of  the  learning  component — except  the  EBL  (we  assume  that  a  con¬ 
ventional  EBL  package  can  be  imported  in  the  future) — have  been  implemented  in 
the  LODES  system.  The  learned  control  plan  for  the  example  problem  described 
in  this  paper  represents  the  output  of  our  implemented  learning  component.  The 
CPU  time  and  the  elapsed  time  of  the  learning  process  for  this  example  cire  ap¬ 
proximately  5.12  seconds  and  102.5  seconds  in  C  and  LISP  (interpreter)  on  Sun 
machines  (SpeircStation  1  (SSI),  SSl-f  and  SS2)^.  The  difference  between  the 
CPU  time  and  the  elapsed  time  is  caused  by  communications  and  synchroniza¬ 
tion.  The  modified  agent  control  resulted  in  a  decrease  of  approximately  10%  in 
the  time  for  aU  agents  to  arrive  at  a  diagnosis  of  the  first  problem  because  of  ad¬ 
ditional  coordination  activities.  The  total  number  of  communication  packets  sent 
among  agents  is  almost  identical  to  the  amount  before  the  learning,  but  those 
that  passed  through  the  slow  line  are  approximately  half.  We  want  to  empha¬ 
size  that  quahty  inference  is  more  important  than  the  efficiency  in  this  example 
problem.  Before  the  learning  is  applied,  agents  almost  bhndly  create  their  plans 
based  on  the  subjective  view  illustrated  in  Fig.  4  (a),  which  did  not  include  the 
view  of  non-local  agents;  thus,  the  environment  caused  the  secondary  problem. 
The  learning  enables  the  agents  to  create  their  plans  based  on  more  accurate  and 
appropriate  views  about  agent  networks,  task  relations  and  their  environments 
as  illustrated  in  Fig.  4  (b).  An  efficiency-oriented  example  is  described  in  [17]. 

6  Conclusion 

Although  coordination  is  an  essential  technique  for  cooperative  distributed  prob¬ 
lem  solving,  there  needs  to  be  a  balance  between  too  much  or  too  Httle  coordi¬ 
nation  since  both  can  degrade  overall  system  performance.  The  problem  is  how 
to  create  for  a  specific  situation  the  appropriate  model  of  the  network  state,  and 
the  necessary  processing  of  this  information  to  arrive  at  an  acceptable  control 
decision.  We  feel  it  is  an  impossible  task  for  the  system  designers,  at  design  time, 

^The  CPU  time  can  be  improved  by  compiling  and/or  optimizing  the  LISP  program. 
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to  choose  this  appropriate  balance  in  all  situations,  especially  for  systems  oper¬ 
ating  in  open  and  evolving  environments.  This  paper  has  discussed  a  learning 
method  for  acquiring  situation-specific  coordination  control  rules.  The  method 
enables  the  system  to  avoid  a  previously  recognized  non-coherent  situation  by 
modifying /extending  local  control  strategies  to  be  more  situation-specific  so  as 
to  have  a  more  enhanced  view  of  the  task  relations  and  the  environment.  This 
approach  to  learning  in  a  multiagent  system  is  quite  different  in  character  from 
reinforcement  learning  approaches  or  statistical  approaches  to  learning  rules  and 
peirameters  done  by  other  researchers  [10,  13,  14]. 
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